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(57) Abstract 

Genetic polymorphisms are identified in the human UGT1 gene that alter UGT1 -dependent drag metabolism. Nucleic acids comprising 
the polymorphic sequences are used to screen patients for altered metabolism for UGT1 substrates, potential drug-drug interactions, and 
adverse/side effects, as well as diseases that result from environmental or occupational exposure to toxins. The nucleic acids are used to 
establish animal, cell and in vitro models for drug metabolism. 
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GENOTYPING THE 
HUMAN UDP-GLUCURONOSYLTRANSFERASE 1 (UGT1) GENE 

Introduction 

5 The metabolic processes commonly involved in the biotransformation of xenobiotics 

have been classified into functionalization reactions (phase I reactions), in which lipophilic 
compounds are modified via monooxygenation, dealkylation, reduction, aromatizatron, or 
hydrolysis. These modified molecules can then be substrates for the phase II reactions, 
often called conjugation reactions, as they conjugate a functional group with a polar, 

10 endogenous compound. Drug glucuronidation, a major phase II conjugation reaction in the 
mammalian detoxification system, is catalyzed by the UDP-glucuronosyltransferases 
(UGTs) (Batt AM, et al. (1994) Clin Chim Acta 226:171-190; Burchell et al. (1995) Life Sci, 
57:1819-31). 

The UGTs are a family of enzymes that catalyze the glucuronic acid conjugation of 
15 a wide range of endogenous and exogenous substrates including phenols, alcohols, 

amines and fatty acids. The reactions catalyzed by UGTs permit the conversion of a large 
range of toxic endogenous/xenobiotic compounds to more water-soluble forms for 
subsequent excretion (Parkinson A (1 996) Toxicol Pathol 24:48-57). 

The UGT isoenzymes are located primarily in hepatic endoplasmic reticulum and 
20 nuclear envelope (Parkinson A (1 996) Toxicol Pathol 24:48-57), though they are also 
expressed in other tissues such as kidney and skin. UGTs are encoded by a large 
multigene superfamily that has evolved to produce catalysts with differing but overlapping 
substrate specificities. Three families, UGT1, UGT2, and UGT8, have been identified 
within the superfamily. UGTs are assigned to one of the subfamilies based on amino acid 
25 sequence identity, e.g., UGT1 family members have greater than 45% amino acid 
sequence identity (Mackenzie et al. 1997) Pharmacogenetics 7:255-69). 

The UGT1 locus is located on chromosome 2q37, and contains at least 12 
promoters/first exons, which are apparently able to splice with common exons 2 through 5, 
producing gene products having strikingly different N-terminal halves (amino acid sequence 
30 identities ranging from 24% to 49%), but identical C-terminal halves (Fig. 1). At least eight 
different isoenzymes are encoded by the UGT1 locus; at least one or more first exons 
encode pseudogenes. The different N-terminal halves encoded by the first exons confer 
different substrate binding specificities upon the UGT1 isoenzymes, while exons 2-5, which 
are present in all UGT1 isoenzyme mRNAs, encode the UDP-glucuronic acid binding 
35 domain, membrane anchorage site, and ER retention signal that are common to all UGT 
proteins (Ritter et al. (1992) J Biol Chem 267:3257-3261). UGT1 locus isoenzymes are 
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best known for their role in glucoronidation and metabolism of many substrates, including 
bilirubin (1A1, 1D1), planar and non-planar phenols, naphthols (1F1) (Ouzzine M t et a!. 
(1994) ftfr ft pjnchem Bioohvs 310:196-204), anthraquinones, flavones, aliphatic alcohols, 
aromatic carboxylic acids, and steroids (Ebner T, et al. (1993) Prug Metafr Djspos 21:50- 
55). 

In addition to UGT1 exon usage, metabolism of endogenous and exogenous 
substrates can also be affected by competitive binding phenomena. For example, in some 
cases exogenous substrates for the UGT1 enzymes have a higher binding affinity or avidity 
for the enzyme than the endogenous UGT1 substrates. For example, UGT1*1 , the major 
bilirubin-metabolizing form of UGT1, more readily binds both octyl-gallate and emodin than 
it binds bilirubin, thus indicating the potential of these xenobiotics to cause jaundice by 
inhibition of bilirubin binding to UGT1*1 (where 1*1 indicates that the first exon is used in 
the spliced gene product). UGT1*1 is also responsible for glucuronidation of the oral 
contraceptive ethinyiestradiol (Ebner et al. (1993) Mol. Pharmacol, 43:649-54), and can 
1 5 also glucuronidate phenols, anthroquinones, flavones, and certain endogenous steroids. 
As noted above, the first exon present in UGT1 can affect substrate binding 
specificity of the UGT1 gene product (for a review, see Burchell (1995) Life $cl t 57:1819- 
31). For example, UGT1*2 accepts a wide range of compounds as substrates including 
non-planar phenols, anthraquinones, flavones, aliphatic alcohols, aromatic carboxylic acids, 
20 steroids (4-hydroxyestrone, estrone) and many drugs of varied structure (Ebner et al. 
(1993) Hrn g Metab. Disp. 21:50-5; Burchell (1995) Life Sci, 57:1819-31). In contrast, 
UGT1*6 exhibits only limited substrate specificity for planar phenolic compounds relative to 
other human UGTs. 

Polymorphisms can markedly affect binding of the endogenous substrate, which can 
25 be manifested as clinical syndromes. At least two conditions, Crigler-Najjar syndrome and 
Gilbert syndrome, are associated with UGT1 polymorphisms. Both of these syndromes are 
hereditary and are associated with predominantly unconjugated hyperbilirubinemia. Crigler- 
Najjar syndrome is associated with intense, persistent jaundice which begins at birth. 
Some affected infants die in the first weeks or months of life with kemicterus; others survive 

30 with little or no neurologic defect. Crigler-Najjar syndrome is caused by a defect in the 
ability of UGT1 to catalyze UDP-glucuronidation of bilirubin, resulting in accumulation of 
bilirubin in the blood (Erps et al. (1994) J Clin. Invest. 93:564-70). Gilbert syndrome is a 
benign mild form of unconjugated hyperbilirubinemia that is characterized by normal liver 
function tests, normal liver histology, delayed clearance of bilirubin from the blood, and mild 

35 jaundice that tends to fluctuate in severity. As with Crigler-Najjar syndrome, Gilbert 
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syndrome is associated with a defect in UGT1 . Specific UGT polymorphisms that are 
known to be associated with disease are indicated in Fig. 1. 

Alteration of the expression or function of UGTs may also affect drug metabolism. 
For example, there may be common polymorphisms in the human UGT1 gene that alter 
expression or function of the protein product and cause drug exposure-related phenotypes. 
Thus, there is a need in the field to identify UGT1 polymorphisms in order to provide a 
better understanding of drug metabolism and the diagnosis of drug exposure-related 
phenotypes. 

Relevant literature 

Genbank accession number M84122 provides UGT1 exon 2, M84123 provides 
exons 3 and 4, M84124 provides 5, M84125 provides exon 1A, M84127 provides exon 1C, 
M84128 provides exon 1D, M84129 provides exon 1E ( M84130 provides exon IF, U39570 
provides exon 1G, U42604 provides exon 1H, U39550 provides exon 1J. 

The UGT gene superfamily and recommended nomenclature for describing UGT 
genes and alleles are reviewed in Mackenzie et al. (1997) Pharmacogenet . 7:255-69. 

The two UGT1 A6 genetic polymorphisms are described in Ciotti et al. (1997) Any J- 
Hum. Genet. 61(Supp):A249. The identification of Asp446 as a critical residue in UGT1 is 
described in Iwano et al. (1997) Biochem. J. 325:587-91. 

A review of the substrate specificity of human UDP-glucuronosyltransferases is 
provided by Burchell et al. (1995) Life Sci. 57:1819-31. For a review of drug 
glucoronidation in humans, see Miners et al. (1991) Pharmacol. Ther. 51:347-69. 

At least twelve UGT1 A1 polymorphisms have been identified and linked to disease. 
These UGT1A1 alleles, each of described in OMIM Entry 191740 (at 
http://www.ncbi.nlm.nih.gov/htbin-post/Omim/dispmim7191740) and in OMIM Entry 143500 
(at http://www.ncbi.nlm.nih.gov/htbin-post/Omim/dispmim7143500), include: 

1) UGT1*FB (UGT1A1, 13-BP DEL, EX2; 191740.0001), which contains a 13 bp 
deletion in exon 2 and is associated with Crigler-Najjar syndrome type I (CN-I); 

2) UGT1A1, EXON4, C-T, SER-PHE (191740.0002), which contains a C-to-T 
transition in exon 4 (resulting in an amino acid change from serine to phenylalanine) is 
associated with CN-I and deficiency of both bilirubin-UGT and phenol-UGT activities in the 
liver; 

3) UGT1A1, GLN331TER (191740.0003), which contains a C-to-T transition 6 bp 
upstream from the 3-prime end of exon 2 of the common region (replacement of a 
glutamine codon with a stop codon), is associated with CN-I; 
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4) UGT1A1, ARG341TER (191740.0004), which contains a nonsense mutation 

(CGA-to-TGA) in exon 3 and is associated with CN-I and a total absence of all 

phenol/bilirubin UGT proteins and their activities in liver homogenate by enzymologic and 

immunochemical analysis; 
5 5) UGT1A1 , GLN331 ART (191740.0005), which contains an A-to-G transition 5 bp 

upstream of the exon 2/intron 2 boundary (resulting in a glutamine-to-arginine substitution), 

is associated with Crigler-Najjar Syndrome, type II (CN-II); 

6) UGT1A1 , PHE170DEL (191740.0006), which contains a deletion of the 

phenylalanine codon at position 170 in exon 1, and is associated with CN-I; 
10 7) UGT1A1 , SER376PHE (191740.0007), which contains a C-to-T transition in 

codon 376 (resulting in a change of serine to phenylalanine) and is associated with CN-I; 

8) UGT1A1 , GLY309GLU (191740.0008), which contains a G-to-A transition in 
codon 309 (resulting in a glycine to glutamic acid change) and is associated with CN-I; 

9) UGT1A1 , NT840, C-A, CYS-TER (191740.0009), which contains a C-to-A 

15 transversion at base position 840 in exon 1 (resulting in replacing a cysteine with a stop 
codon), is associated with CN-I; 

10) UGT1A1. PRQ229GLN (191740.00010), which contains C-to-A transversion at 
nucleotide 686 (changing proline-229 to glutamine). is associated with Gilbert syndrome; 

11) UGT1A1, 2-BP INS, TA INS, TATAA ELEMENT (191740.00011) contains 2 
20 extra bases (TA) in the TATAA element of the 5-prime promoter region of the gene (where 

normally an A(TA)6TAA element is present between nucleotides -23 and -3) and is 
associated with Gilbert syndrome; and 

12) UGT1A1, 1-BP INS, 470T INS (191740.00012), which contains 470insT 
mutation in exon 1 and is associated with CN-I. 

25 

Summary of the Invention 
Genetic sequence polymorphisms are identified in the UGT1 gene. Nucleic acids 
comprising the polymorphic sequences are used in screening assays, and for genotyping 
individuals. The genotyping information is used to predict an individuals' rate of metabolism 
30 for UGT1 substrates, potential drug-drug interactions, and adverse/side effects. 

Accordingly, in one aspect the invention features an isolated nucleic acid molecule 
comprising a UGT1 sequence polymorphism of SEQ ID NOS:87-124, as part of other than 
a naturally occurring chromosome. In related aspects, the invention features nucleic acid 
probes for detection of UGT1 locus polymorphisms, where the probe comprises a 
35 polymorphic sequence of SEQ ID NOS:87-1 24. 
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In another aspect the invention features an array of oligonucleotides comprising two 
or more probes for detection of UGT1 locus polymorphisms, where the probes comprise at 
least one form of a polymorphic sequences of SEQ ID NOS:87-124. 

In still another aspect, the invention features a method for detecting in an individual 
5 a polymorphism in UGT1 metabolism of a substrate, where the method comprises 

analyzing the genome of the individual for the presence of at least one UGT1 polymorphism 
of SEQ ID NOS:87-124; wherein the presence of the predisposing polymorphism is 
indicative of an alteration in UGT1 expression or activity. 

In one embodiment, the analyzing step of the method is accomplished by detection 
10 of specific binding between the individual's genomic DNA with an array of oligonucleotides 
comprising two or more probes for detection of UGT1 locus polymorphisms, where the 
probes comprise at least one form of a polymorphic sequence of SEQ ID NOS:87-124. 

In other embodiments of the method, the alteration is UGT1 expression or activity is 
tissue specific, or is in response to a UGT1 modifier. The UGT1 modifier may either induce 
1 5 or inhibit UGT1 expression. 

Brief Description of the Drawings 
Fig. 1 is a schematic showing the UGT1 locus. Each of the first exons is denoted by 
both its alphabetic and numerical nomenclatures (e.g., 1A and 1.1). 
20 Fig. 2 is a schematic showing exons 1 A-1 J of the UGT1 locus and the 

polymorphisms described in the present application. 

Fig. 3 is a schematic showing the exons 1 A-1 F, and 2-5 of the UGT1 locus and the 
polymorphisms that have been publicly disclosed. 

25 brief Description of the Sequence Listing 

UGT1 Reference Sequences. SEQ ID NOS: 1, 3, 5, 7, 9, 11. 13, and 15 are the 
UGT1 reference polynucleotide sequences for UGT1 exons 1A. 1C. 1D. 1E. 1F. 1G, 1H. 
and 1 J. The polypeptide sequences are encoded by these reference exon sequences are 
SEQ ID NOS:2, 4. 6, 8, 12, 14, and 16. SEQ ID NOS: 17 and 18 are the reference 
30 polynucleotide and amino acid sequences for UGT1 exons 2-5. 

PCR Primers. The primary and secondary PCR primers for amplification of 
polymorphic sequences are presented as SEQ ID NOS:19-50. 

Sequencing Primers. The primers used in sequencing isolated polymorphic 
sequences are presented as SEQ ID NOS:51-86. 
35 Polymorphisms. Polymorphic sequences of the invention are presented as SEQ ID 

NOS.88-124. 
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Description of the specific embodiments 
Pharmacogenetics is the linkage between an individual's genotype and that 
individual's ability to metabolize or react to a therapeutic agent Differences in metabolism 
or target sensitivity can lead to severe toxicity or therapeutic failure by altering the relation 
5 between bioactive dose and blood concentration of the drug. Relationships between 

polymorphisms in metabolic enzymes or drug targets and both response and toxicity can be 
used to optimize therapeutic dose administration. 

Genetic polymorphisms are identified in the UGT1 gene. Nucleic acids comprising 
the polymorphic sequences are used to screen patients for altered metabolism for UGT1 
10 substrates, potential drug-drug interactions, and adverse/side effects, as well as d.seases 
that result from environmental or occupational exposure to toxins. The nucleic acds are 
used to establish animal, cell culture and in vitro cell-free models for drug metabolism. 



15 



20 



25 



30 



35 



pftfmitions 

It is to be understood that this invention is not limited to the particular methodology, 
protocols, cell lines, animal species or genera, constructs, and reagents described, as such 
may vary It is also to be understood that the terminology used herein is for the purpose of 
describing particular embodiments only, and is not intended to limit the scope of the present 
invention which will be limited only by the appended claims. 

As used herein the singular forms "a", "and", and "the" include plural referents 
unless the context clearly dictates otherwise. Thus, for example, reference to "a construct" 
includes a plurality of such constructs and reference to "the UGT1 nucleic acid" includes 
reference to one or more nucleic acids and equivalents thereof known to those skilled .n the 
art and so forth. All technical and scientific terms used herein have the same meamng as 
commonly understood to one of ordinary skill in the art to which this invention belongs 
unless clearly indicated otherwise. 

UGT1 polymorphic sequences. The sequence of the UGT1 gene is known in the 
art, and accessible in public databases, as cited above. This sequence is useful as a 
reference for the genomic location of the human gene, and for specific coding region 
sequences. As used herein, the term "UGT1 gene" is intended to refer to both the w,ld-ty P e 
and variant sequences, unless, specifically denoted otherwise. Nucleic acids of particular 
interest comprise the provided variant nucleotide sequence(s). For screening purposes, 
hybridization probes may be used where both polymorphic forms are present, either .n 
separate reactions, or labeled such that they can be distinguished from each other. Assays 
may utilize nucleic acids that hybridize to one or more of the described polymorphs. 
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The genomic UGT1 sequence is of particular interest. A polymorphic UGT1 gene 
sequence, i.e. including one or more of the provided polymorphisms, is useful for 
expression studies to determine the effect of the polymorphisms on enzymatic activity. The 
polymorphisms are also used as single nucleotide polymorphisms to detect genetic 
5 association with phenotypic variation in UGT1 activity and expression. 

The UGT1 exon structure is illustrated in Fig. 1. The UGT1 locus contains at least 
12 promoters/first exons, which are apparently able to splice with common exons 2 through 
5, producing gene products having different N-terminal halves but identical C-terminal 
halves. The first exon utilized at least in part determines the substrate specificity of the 
10 resulting UGT1 gene product. Each of the first exons in Fig. 1 is denoted by both its 
alphabetic and numerical nomenclatures (e.g., 1A and 1.1). Polymorphisms in the UGT1 
first exon can be associated with alteration of substrate binding specificity and/or disease. 
Fig. 2 shows UGT1 exons 1A-1J and the polymorphisms described in the present 
application. Fig. 3 shows UGT1 exons 1 A-1F and 2-5 and the polymorphisms in these 
1 5 exons that have been publicly disclosed. Polymorphisms denoted by an asterisk (*) have 
been assigned the indicated "allele name" (e.g., *12). The specific associated disease is 
indicated below in parentheses for several of these disease-associated polymorphisms. 
Except for the "mutation" that is associated with Gilbert's (*28, which is not universally 
agreed upon in the literature), all mutations in exons 1D. 1A, and 2-5 were isolated from 

20 individuals with disease. 

Fragments of the DNA sequence are obtained by chemically synthesizing 
oligonucleotides in accordance with conventional methods, by restriction enzyme digestion, 
by PCR amplification, etc. For the most part. DNA fragments will be of at least 1 5 nt, 
usually at least 20 nt. often at least 50 nt. Such small DNA fragments are useful as primers 

25 for PCR, hybridization screening, etc. Larger DNA fragments, i.e. greater than 100 nt are 
useful for production of the encoded polypeptide, promoter motifs, etc. For use in 
amplification reactions, such as PCR, a pair of primers will be used. The exact composition 
of primer sequences is not critical to the invention, but for most applications the primers will 
hybridize to the subject sequence under stringent conditions, as known in the art. 
30 The UGT1 nucleic acid sequences are isolated and obtained in substantial purity, 

generally as other than an intact mammalian chromosome. Usually, the DNA will be 
obtained substantially free of other nucleic acid sequences that do not include a UGT1 
sequence or fragment thereof , generally being at least about 50%, usually at least about 
90% pure and are typically "recombinant". i.e. flanked by one or more nucleotides with 
35 which it is not normally associated on a naturally occurring chromosome. 
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UGT1 polypeptides. The UGT1 genetic sequence, including polymorphisms, may 
be employed for synthesis of a complete UGT1 protein, or polypeptide fragments thereof, 
particularly fragments corresponding to functional domains; binding sites; etc.; and 
including fusions of the subject polypeptides to other proteins or parts thereof. For 
5 expression, an expression cassette may be employed, providing for a transcriptional and 
translational initiation region, which may be inducible or constitutive, where the coding 
region is operably finked under the transcriptional control of the transcriptional initiation 
region, and a transcriptional and translational termination region. Various transcriptional 
initiation regions may be employed that are functional in the expression host. The 
10 polypeptides may be expressed in prokaryotes or eukaryotes in accordance with 

conventional ways, depending upon the purpose for expression. Small peptides can also 
be synthesized in the laboratory. 

Substrate. A substrate is a chemical entity that is modified by UGT1 , usually under 
normal physiological conditions. Although the duration of drug action tends to be shortened 
1 5 by metabolic transformation, drug metabolism is not 'detoxification". Frequently the 
metabolic product has greater biologic activity than the drug itself. In some cases the 
desirable pharmacologic actions are entirely attributable to metabolites, the administered 
drugs themselves being inert. Likewise, the toxic side effects of some drugs may be due in 
whole or in part to metabolic products. 
20 Substrates can be either endogenous substrates (e.g., substrates normally found 

within the natural environment of UGT1. such as the bilirubin or other endobiotic 
compound) or exogenous (e.g.. substrates that are not normally found within the natural 
environment of UGT1 . such as ethinyl estradiol or other xenobiotic compound). Exemplary 
UGT1 substrates {i.e., substrates of wild-type UGT1 and/or UGT1 polypeptides encoded by 
25 UGT1 polymorphisms) include, but are not necessarily limited to endobiotics such as 
bilirubin, bilirubin monoglucoronide, bile acids, steroids, thyroxine, biogenic amines, fat- 
soluble vitamins, UDPGA, 17p estradiol, estriol, 2-hydroxy-estriol, T4,rT3. and the like; and 
xenobiotics such as hydroxylated polycyclic aromatic hydrocarbons, heterocyclics, 
carcinogens, plant metabolites, octyl gallate. ethinytestradiol, anthraflavic acid, quercetin, 1- 
30 naphthol. naphthylamines, 4-aminobiphenyl, benzidine, imipamine, BP-3.6-quinol. 5- 

hydroxy-BP. acetaminophen, vanillin, naproxen, 4-methylumbelliferone, monohalogenated 
phenols, propofol, 4t-pentylphenol. 4-hydroxybiphenyl. carvacrol, emodin, galangin, bulky 
phenols, carboxylic acids. 5-hydroxy 2AAF. 8-hydroxy 2AAF, and the like. Table 1 provides 
a summary of the major endobiotic and xenobiotic substrates, as well as exemplary non- 
35 substrates, of four UGT1 isoenzymes (UGT1*1 (same as UGT1A), UGT1M (same as 
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UGT1 D) y UGT1*6 (same as UGT1F), and UGT1*02 (same as UGT1G) (see Burchell et al. 
91995) \MS£L 57:1819-31). 



Table 1 Substrate Specificity of Human Liver UGT1 Isoenzymes 



Isoenzyme 



Endobiotic 



Xenobiotic 



Non-substrate 



UGT1*1 



UGTI*4 



Bilirubin (Km 24 um) 
Bilirubin monoglucuronide 
UDPGA(Km0.41mM) 
17(3 estradiol 
Estriol 

2-hydrozy-estriol 
T4,rT3 



Ocryl gallate (Km 162 um) 

Ethinylestradiol 

Anthraflavic acid 

Quercetin 

1-naphthol 



Gallic acid 
T3 

Menthol 

Retinoic acid 

Clofibrate 

Morphine 

Propofol 

Testosterone 



Bilirubin? 



Naphthylamines 
4-aminobiphenyl 
Benzide 
Imipamine 



Bilirubin? 
Carbarn azepme 



UGT1*6 



1-Naphthol 

BP-3,6-quinol 

5-hydroxy-BP 

Acetaminophen (Km 2 mM) 

Vanillin 

Naproxen 

4-methylumbelliferone 
Monohalogenated phenols 



4-Hydroxybiphenyl 

Propofol 

Galangin 

Emodtn 

Morphine 

Estriol 

Estradiol 

AZT 

Menthol 



UGT1*7 



UDPGA (Km 0.41 mM) 
T4,rT3 



Propofol (Km 172 um) 
4t-pentylphenol 

4- hydroxybiphenyl 
Carvacrol 
Emodin 
Galangin 

Octyl gallate (Km 1 58 uM) 
Other bulky phenols 
Acetaminophen (Km 50 mM) 
Carboxylic acids (some) 

5- hydroxy 2AAF 
8-hydroxy2AAF 



Morphine 

Estriol 

Estradiol 

AZT 

Menthol 

Chloramphenicol 

Androsterone 

T3 



10 



15 



Modifier. A modifier is a chemical agent that modulates the action of UGT1 , either 
through altering its enzymatic activity (enzymatic modifier) or through modulation of 
expression (expression modifier; e.g., by affecting transcription or translation). In some 
cases the modifier may also be a substrate. For example, the UGT1 gene contains an 
electrophile responsive element (USPN 5,589,504); thus, compounds such as metabolites 
of planar aromirtic compounds and phenolic antioxidants, as well as reactive oxygen 
species including peroxides would be expression modifiers via their effect on the 
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ele^e ^sponsive element Endogenous and exogenous inducers that are capable of 
inducing particular UGT activities include phenobarbital, dioxin, peroxisome prolrferators, 
rifamicin, oral contraceptive drug, carbamazepine, cigarette smoke, cabbage, brussel 
sprouts polycyclic/aromatic hydrocarbons, and derivatives of indole 3-carbon.l (see 
Burchell et al. (1995). supra, Parkinson In: 'Biotransformation of Xenobiotics." Chapter 6. 
^.p^PflMir* Toxicology. 5* Ed., C. Klaassen, ed.)). 

Pharmacokinetic parameters. Pharmacokinetic parameters provide fundamental 
data for designing safe and effective dosage regimens. A drug's volume of distribute, 
clearance and the derived parameter, half-life, are particularly important, as they determme 
the degree of fluctuation between a maximum and minimum plasma concentration dunng a 
dosage interval, the magnitude of steady state concentration and the time to reach steady 
state plasma concentration upon chronic dosing. Parameters derived from in wo drug 
administration are useful in determining the clinical effect of a particular UGT1 genotype. 

Expression assay. An assay to determine the effect of a sequence polymorphs 
on UGT1 expression. Expression assays may be performed in cell-free extracts, or by 
transforming cells with a suitable vector. Alterations in expression may occur in the basal 
,eve. that is expressed in one or more cel. types, or in the effect that an expression modtfer 
has on the ability of the gene to be inhibited or induced. Expression levels of a vanant 
alleles are compared by various methods known in the art. Methods for determ.n,ng 
promoter or enhancer strength include quantitation of the expressed natural protem; 
insertion of the variant control element into a vector with a reporter gene such as 
p-galactosidase. luciferase, chloramphenicol acetyltransferase. etc. that provides for 
convenient quantitation; and the like. 

Gel shift or electrophoretic mobility shift assay provides a simple and rapid method 
for detecting DNA-binding proteins (Ausubel. F.M. et a/. (1989) In: Current Protocols .n 
Molecular Biology, VoL 2, John Wiley and Sons. New York). This method has been used 
widely in the study of sequence-specific DNA-binding proteins, such as transcnpfon 
factors The assay is based on the obsen^ation that complexes of protein and DNA m.grate 
throughanondenaturingpolyacrylamidegelmoreslowlythanfreeDNAfragmentsor 

double-stranded oligonucleotides. The gel shrft assay is performed by incubating a punfied 
protein or a complex mixture of proteins (such as nuclear or cell extract preparations), wrth 
an end-labeled DNA fragment containing the putative protein binding site. The react™ 
products are then analyzed on a nondenaturing polyacrylamide gel. The spedficrty of the 
DNA-binding protein for the putative binding site is established by competition expenments 
35 using DNA fragments or oligonucleotides containing a binding site for the protem of 
interest, or other unrelated DNA sequences. 



30 
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Expression assays can be used to detect differences in expression of 
polymorphisms with respect to tissue specificity, expression level, or expression in 
response to exposure to various substrates, and/or timing of expression during 
development. For example, since UGT1A and UGT1E are expressed in liver, UGT1A and 
5 UGT1 E polymorphisms could be evaluated for expression in tissues other than liver, or 
expression in liver tissue relative to a reference UGT1A or UGT1E polypeptide. Similarly, 
expression of polymorphisms in UGT1F. which is normally expressed in liver, kidney and 
skin, could be assayed in each of these tissues and the relative levels of expression 
compared to a reference UGT1F polypeptide. 
10 Suosfrafe screening assay. Substrate screening assays are used to determine the 

metabolic activity of a UGT1 protein or peptide fragment on a substrate. Many suitable 
assays are known in the art. including the use of primary or cultured cells, genetically 
modified cells (e.g., where DNA encoding the UGT1 polymorphism to be studied is 
introduced into the cell within an artificial construct), cell-free systems, e.g. microsomal 
1 5 preparations or recombinantly produced enzymes in a suitable buffer, or in animals, 
including human clinical trials (see. e.g., Burchell et al. (1995) Lif&SsL 57:1819-1831, 
specifically incorporated herein by reference. Where genetically modified cells are used, 
since most cell lines do not express UGT1 activity (liver cells lines being the exception), 
introduction of artificial construct for expression of the UGT1 polymorphism into many 
20 human and non-human cell lines does not require additional modification of the host to 
inactivate endogenous UGT1 expression/activity. Clinical trials may monitor serum, urine, 
efc. levels of the substrate or its metabolite(s). 

Typically a candidate substrate is input into the assay system, and the conversion to 
a metabolite is measured over time. The choice of detection system is determined by the 
25 substrate and the specific assay parameters. Assays are conventionally run. and will 

include negative and positive controls, varying concentrations of substrate and enzyme, etc. 

Genotyping: UGT1 genotyping is performed by DNA or RNA sequence and/or 
hybridization analysis of any convenient sample from a patient, e.g. biopsy material, blood 

30 sample (serum, plasma, etc.). buccal cell sample, etc. A nucleic acid sample from an 
individual is analyzed for the presence of polymorphisms in UGT1 , particularly those that 
affect the activity or expression of UGT1 . Specific sequences of interest include any 
polymorphism that leads to changes in basal expression in one or more tissues, to changes 
in the modulation of UGT1 expression by modifiers, or alterations in UGT1 substrate 

35 specificity and/or activity. 
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Linkage Analysis: Diagnostic screening may be performed for polymorphisms that 
are genetically linked to a phenotypic variant in UGT1 activity or expression, particularly 
through the use of microsatellite markers or single nucleotide polymorphisms (SNP). The 
microsatellite or SNP polymorphism itself may not phenotypically expressed, but is linked to 
5 sequences that result in altered activity or expression. Two polymorphic variants may be in 
linkage disequilibrium, i.e. where alleles show non-random associations between genes 
even though individual loci are in Hardy-Weinberg equilibrium. 

Linkage analysis may be performed alone, or in combination with direct detection of 
phenotypically evident polymorphisms. The use of microsatellite markers for genotyping is 
10 well documented. For examples, see Mansfield et al. (1994) Genomics 24:225-233; and 
Ziegle et al. (1992) Genomics 14:1026-1031. The use of SNPs for genotyping is illustrated 
in Underbill et al. (1996) Proc Natl Acad SciUSA 93:196-200. 

Transgenic animafs. The subject nucleic acids can be used to generate genetically 
15 modified non-human animals or site specific gene modifications in cell lines. The term 
"transgenic" is intended to encompass genetically modified animals having a deletion or 
other knock-out of UGT1 gene activity, having an exogenous UGT1 gene that is stably 
transmitted in the host cells, or having an exogenous UGT1 promoter operably linked to a 
reporter gene. Transgenic animals may be made through homologous recombination, 
20 where the UGT1 locus is altered. Alternatively, a nucleic acid construct is randomly 

integrated into the genome. Vectors for stable integration include plasmids, retroviruses 
and other animal viruses, YACs, and the like. Of interest are transgenic mammals, e.g. 
cows, pigs, goats, horses, etc., and particularly rodents, e.g. rats, mice, etc. 

25 Genetically Modified Cells. Primary or cloned cells and cell lines are modified by the 

introduction of vectors comprising UGT1 gene polymorphisms. The gene may comprise 
one or more variant sequences, preferably a haplotype of commonly occurring 
combinations. In one embodiment of the invention, a panel of two or more genetically 
modified cell lines, each cell line comprising a UGT2B4 polymorphism, are provided for 

30 substrate and/or expression assays. The panel may further comprise cells genetically 
modified with other genetic sequences, including polymorphisms, particularly other 
sequences of interest for pharmacogenetic screening, e.g. UGT1, other UGT2 sequences, 
cytochrome oxidase polymorphisms, etc. 

Vectors useful for introduction of the gene include plasmids and viral vectors, e.g. 

35 retroviral-based vectors, adenovirus vectors, eto. that are maintained transiently or stably in 
mammalian cells. A wide variety of vectors can be employed for transfection and/or 
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integration of the gene into the genome of the cells. Alternatively, micro-injection may be 
employed, fusion, or the like for introduction of genes into a suitable host cell. 



Genotvping Methods 

5 The effect of a polymorphism in the UGT1 gene sequence on the response to a 

particular substrate or modifier of UGT1 is determined by in vitro or in vivo assays. Such 
assays may include monitoring the metabolism of a substrate during clinical trials to 
determine the UGT1 enzymatic activity, specificity or expression level. Generally, in vitro 
assays are useful in determining the direct effect of a particular polymorphism, while clinical 

10 studies will also detect an enzyme phenotype that is genetically linked to a polymorphism. 

The response of an individual to the substrate or modifier can then be predicted by 
determining the UGT1 genotype, with respect to the polymorphism. Where there is a 
differential distribution of a polymorphism by racial background, guidelines for drug 
administration can be generally tailored to a particular ethnic group. 

15 The basal expression level in different tissue may be determined by analysis of 

tissue samples from individuals typed for the presence or absence of a specific 
polymorphism. Any convenient method may be use, e.g. ELISA, RIA, etc. for protein 
quantitation, northern blot or other hybridization analysis, quantitative RT-PCR, etc. for 
mRNA quantitation. The tissue specific expression is correlated with the genotype. 

20 The alteration of UGT1 expression in response to a modifier is determined by 

administering or combining the candidate modifier with an expression system, e.g. animal, 
ceil, in vitro transcription assay, etc. The effect of the modifier on UGT1 transcription 
and/or steady state mRNA levels is determined. As with the basal expression levels, tissue 
specific interactions are of interest. Correlations are made between the ability of an 

25 expression modifier to affect UGT1 activity, and the presence of the provided 

polymorphisms. A panel of different modifiers, cell types, etc. may be screened in order to 
determine the effect under a number of different conditions. 

A UGT1 polymorphism that results in altered enzyme activity or specificity is 
determined by a variety of assays known in the art. The enzyme may be tested for 

30 metabolism of a substrate in vitro, for example in defined buffer, or in cell or subcellular 
lysates, where the ability of a substrate to be metabolized by UGT1 under physiologic 
conditions is determined. Where there are not significant issues of toxicity from the 
substrate or metaboltte(s), in vivo human trials may be utilized, as previously described. 

The genotype of an individual is determined with respect to the provided UGT1 gene 

35 polymorphisms. The genotype is useful for determining the presence of a phenotypically 
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evident polymorphism, and for determining the linkage of a polymorphism to phenotypic 
change. 

A number of methods are available for analyzing nucleic acids for the presence of a 
specific sequence. Where large amounts of DNA are available, genomic DNA is used 
directly. Alternatively, the region of interest is cloned into a suitable vector and grown in 
sufficient quantity for analysis. The nucleic acid may be amplified by conventional 
techniques, such as the polymerase chain reaction (PCR), to provide sufficient amounts for 
analysis. The use of the polymerase chain reaction is described in Saiki et aL (1985) 
Science 230:1350-1354, and a review of current techniques may be found in Sambrook et 
al. Molecular Cloning: A Laboratory Manual, CSH Press 1989, pp.14.2-14.33. 
Amplification may be used to determine whether a polymorphism is present, by using a 
primer that is specific for the polymorphism. Alternatively, various methods are known in 
the art that utilize oligonucleotide ligation as a means of detecting polymorphisms, for 
examples see Riley et al. (1990) Nucleic Acids Res 18:2887-2890; and Delahunty et al. 
(1996) Am J Hum Genet 58:1239-1246. 

A detectable label may be included in an amplification reaction. Suitable labels 
include fluorochromes, e.g. fluorescein isothiocyanate (FITC), rhodamine, Texas Red, 
phycoerythrin, allophycocyanin, 6-carboxyfluorescein (6-FAM), ZJ'-dimethoxy-^S 1 - 
dichloro-6-carboxyfluorescein (JOE), 6-carboxy-X-rhodamine (ROX), 6-carboxy-2\4 , ,7\4,7- 
hexachlorofluorescein (HEX), 5-carboxyfluorescein (5-FAM) or N.N.N'.N'-tetramethyl-e- 
carboxyrhodamine (TAMRA), radioactive labels, e.g. 32P, 35S, 3H; etc. The label may be 
a two stage system, where the amplified DNA is conjugated to biotin, haptens, etc. having a 
high affinity binding partner, e.g. avidin, specific antibodies, etc., where the binding partner 
is conjugated to a detectable label. The label may be conjugated to one or both of the 
primers. Alternatively, the pool of nucleotides used in the amplification is labeled, so as to 
incorporate the label into the amplification product. 

The sample nucleic acid, e.g. amplified or cloned fragment, is analyzed by one of a 
number of methods known in the art The nucleic acid may be sequenced by dideoxy or 
other methods. Hybridization with the variant sequence may also be used to determine its 
presence, by Southern blots, dot blots, efc. The hybridization pattern of a control and 
variant sequence to an array of oligonucleotide probes immobilized on a solid support, as 
described in U.S. 5,445,934, or in WO95/35505, may also be used as a means of detecting 
the presence of variant sequences. Single strand conformational polymorphism (SSCP) 
analysis, denaturing gradient gel electrophoresis (DGGE), mismatch cleavage detection, 
and heteroduplex analysis in gel matrices are used to detect conformational changes 
created by DNA sequence variation as alterations in electrophoretic mobility. Alternatively, 
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where a polymorphism creates or destroys a recognition site for a restriction endonuciease 
(restriction fragment length polymorphism, RFLP) t the sample is digested with that 
endonuciease, and the products size fractionated to determine whether the fragment was 
digested. Fractionation is performed by gel or capillary electrophoresis, particularly 
acrylamide or agarose gels. 

In one embodiment of the invention, an array of oligonucleotides are provided, 
where discrete positions on the array are complementary to one or more of the provided 
polymorphic sequences, e.g. oligonucleotides of at least 12 nt, frequently 20 nt, or larger, 
and including the sequence flanking the polymorphic position. Such an array may comprise 
a series of oligonucleotides, each of which can specifically hybridize to a different 
polymorphisms For examples of arrays, see Hacia et al. (1996) Nat Genet 14:441-447 and 
DeRisi et al. (1996) Nat Genet 14:457-460. Arrays of interest may further comprise 
sequences, including polymorphisms, of other genetic sequences, particularly other 
sequences of interest for pharmacogenetic screening, e.g. UGT1, other UGT2 sequences, 
cytochrome oxidase polymorphisms, eta 

The genotype information is used to predict the response of the individual to a 
particular UGT1 substrate or modifier. Where an expression modifier inhibits UGT1 
expression, then drugs that are a UGT1 substrate will be metabolized more slowly if the 
modifier is co-administered. Where an expression modifier induces UGT1 expression, a 
co-administered substrate will typically be metabolized more rapidly. Similarly, changes in 
UGT1 activity will affect the metabolism of an administered drug. The pharmacokinetic 
effect of the interaction will depend on the metabolite that is produced, e.g. a prodrug is 
metabolized to an active form, a drug is metabolized to an inactive form, an environmental 
compound is metabolized to a toxin, etc. Consideration is given to the route of 
administration, drug-drug interactions, drug dosage, etc. 

Examples 

The following examples are put forth so as to provide those of ordinary skill in the 
art with a complete disclosure and description of how to make and use the subject 
invention, and are not intended to limit the scope of what is regarded as the invention. 
Efforts have been made to ensure accuracy with respect to the numbers used (e.g., 
amounts, temperature, concentrations, etc.) but some experimental errors and deviations 
should be allowed for. Unless otherwise indicated, parts are parts by weight, molecular 
weight is average molecular weight, temperature is in degrees centigrade; and pressure is 
at or near atmospheric. 
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Example: identification of UGT1 Polymorphisms 
MATERIALS AND METHODS 

DNA Samples. Blood specimens were collected from approximately 48 individuals 
after obtaining informed consent. All samples were stripped of personal identifiers to 
maintain confidentiality. Genomic DNA was isolated from these samples using standard 
techniques. Genomic DNA was stored either as a concentrated solution, or in a dried form 
in microtiter plates. 

PCR amplifications. The primers used to amplify all exons are shown in Table 2, and 

were designed with NBI's Oligo version 5.0 program. 

Table 2. PCR Primers. (Ex = Exon) 
PRIMARY PCR AMPLIFICATION 



EX 


FORWARD PRIMER 


REVERSE PRIMER 


IA 


TGGTGTATCG ATTGGTTTT (SEQ ID 


CATATATCTGGGGCTAGTTAATC (SEQ ID 




NO: 19) 


NO:20) 


1C 


ACAAGGTAATTAAGATGAAGAAAGCA 


ACCTGAGATAGTGGCTTCCT (SEQ ID 




(SEQIDNO:21) 


NO:22) 


ID 


TTTGTCTTCC A ATTA C ATGC (SEQ ID 


AGTAGATATGGAAGCACTTGTAAG (SEQ 




NO;23) 


ID NO:24) 


IE 


TCTCAGTGACAAGGTAATTAAGAC(SE 


CATTGATTGGATAAAGGCA (SEQ ID NO:26) 




QIDNO:25) 




IF 


AATTTGGGTTCTTACATATCAA(SEQ ID 


GAGTGAGGGAGGACAGAG (SEQ ID NO:28) 




NO:27) 




1G 


ATAAGTACACGCCTTCTTTTG (SEQ ID 


GCTGCTTTATACAATTTGCTAC (SEQ ID 




NO:29) 


NO:30) 


1H 


CGCCTACGTATCATAGCAGTTA(SEQ ID 


GGAAAGAA ATTTGAAATGCAAC (SEQ ID 




NO:31) 


NO:32) 


1J 


TCTTTCCGCCTACTGTATCA (SEQ ID 


TTCAAGAAGGGCAGTTTTAT (SEQ ID 




NO:33) 


NO:34) 



SECONDARY PCR AMPLIFICATION 
EX FORWARD PRIMER 
1 A CTCTGGCAGG AGCAAAG (SEQ ID 
NO:35) 

1 C GGTA ATTAAG ATGAAGAAAGCA(SEQ 
IDNO:37) 

ID GTGGCTCAATGACAAGG (SEQ ID 
NO:39) 

IE TTAAGACGAAGGAAACAATTCT(SEQ 
IDNO:4t) 

IF ATCAAAGGGTAAAATTCAGA (SEQ ID 
NO:43) 

1G TTTTGAGGGCAGGTTCTA (SEQ ID 
NO:45) 

IH TTCTCTCATGGCTCGCA (SEQ ID NO:47) 

1 J CCGCCTACTGTATC ATAGCA (SEQ ID 
NO:49) 



REVERSE PRIMER 

ATACACACCTGGGATAGTGG (SEQ ID 
NO:36) 

CTGAGATAGTGGCTTCCTG (SEQ ID NO:38) 

ATATGGAAGCACTTGTA AGTAAA(SEQ ID 
NO:40) 

ACCTGAGATAGTGGCTTCC (SEQ IDNO:42) 

GGCAGTCCAAAAGAAATA (SEQ ID NO:44) 

AATGGGACAAATGTAA ATGATA (SEQ ID 
NO:46) 

ATGTCAAATCACAATTCAGTAAGG (SEQ ID 
NO:48) 

CAACGAAATGTCAAATCACAG (SEQ ID 
NO:50) 
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Publicly available genomic sequences were used as references. Twenty-five 
nanograms of genomic DNA were amplified in the primary amplifications using the Perkin 
Elmer GeneAmp PCR kit according to the manufacturers instructions in 25 pi reactions with 
AmpliTaq Gold DNA polymerase. Reactions contained 25 mM MgCI2 and 0.2 pM of each 
5 primer. Thermal cycling was performed using a GeneAmp PCR System 9600 PCR machine 
(Perkin Elmer), utilizing a touch-down PCR protocol. The protocol, unless indicated otherwise 
in Table 3, consisted of an initial incubation of 95°C for 10 min, followed by eight cycles of 
95°C for 20 sec, 66°C (minus 1°C per cycle) for 15 sec, 72»C for 2 min, and twenty-seven 
cycles of 95°C for 20 sec, 54°C for 15 sec, 72°C for 2 min, and one final extension step of 
10 72°Cfor10min. 

For the secondary PCR reactions, one microliter of each primary PCR reaction was re- 
amplified using the secondary PCR primers, also listed in Table 2. The thermal cycling profile 
that was used for the primary PCR for an exon was used for the secondary PCR. 

1 5 Table 3. Cycling Profile Modifications 



Exon 


Primary PCR 


Secondary PCR 


IE 


Touch-Down PCR step: 8 cycles 

64 C (minus 1 C per cycle), for 15 sec 

Total Number of cycles: 35 


same as Primary PCR 


IF 


Touch-Down PCR step: 10 cycles 
64 C (minus I C per cycle), for 15 sec 
Total Number of cycles: 35 


same as Primary PCR 


IG 


Touch-Down PCR step: 7 cycles 

64 C (minus 1 C per cycle), for 15 sec 

Total Number of cycles: 35 


same as Primary PCR 


IH 


Touch-Down PCR step: 10 cycles 
66 C (minus 1 C per cycle), for 15 sec 
Total Number of cycles: 35 


same as Primary PCR 



30 DNA sequencing. PCR products from 48 Individuals, approximately one-third 

representing each of the 3 major racial groups (see above), were prepared for sequencing 
by treating 8 pL of each PCR product with 0.15 pL of exonuclease I (1.5 U/reaction), 0.3 pL 
of Shrimp Alkaline Phosphatase (0.3 U/reactjon), q.s. to 10 pL with MilliQ water, and 
incubated at 37°C for 15 min, followed by 72°C for 15 min. Cycle sequencing was 

35 performed on the GeneAmp PCR System 9600 PCR machine (Perkin Elmer) using the ABI 
Prism dRhodamine Terminator Cycle Sequencing Ready Reaction Kit according to the 
manufacturer's directions, with the following changes: (1) 2 pL of dRhodamine terminator 
premix, instead of 8 pL; and (2) 10% (v/v) Dimethylsutfoxide was added to each individual 
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nucleotide. The oligonucleotide primers (unlabeled), at 3 picomoles per reaction, used for 
the sequencing reactions are listed in Table A. Sequencing reactions, with a final volume of 
5 pL, were subjected to 30 cycles at 96°C for 20 sec, 50°C for 5 sec, and 60°C for 4 min, 
followed by ethanol precipitation. After decanting the ethanol, samples were evaporated to 

5 dryness using a SpeedVac for roughly 1 5 min and were resuspended in 2 \x\ of loading 
buffer (5:1 deionized formamide:50 mM EDTA pH 8.0). The samples were then, heated to 
94°C for 2 min, and electrophoresed through 5.25% polyacrylamide/6M urea gels in an ABI 
Prism 377 DNA Sequencer according to the manufacturer's instructions for sequence 
determination. All sequences were determined from both the 5' and 3' (sense and 

10 antisense) direction. 

Of the forty-eight samples, 38 polymorphisms were identified. The polymorphisms 

are described in Table 5 below. 
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Table 4. Sequencing Primers (No. = Polymorphism No.) 
No. FORWARD PRIMER 

CTCTGGCAGG AGCAAAG (SEQ ID NO:5 1) ACAGTGGGCAGAGACAG (SEQ ID 

NO:52) 

GTGGTTTATTCCCCGTAT (SEQ ID NO:53) ATACACACCTGGGATAGTGG (SEQ ID 



REVERSE PRIMER 



GGTAATTAAGATGAAGAAAGCA (SEQ 
IDNO:55) 



1 

2 
3-5 
6 

7,8 
9-12 



13,14 AGAATGGCAATTATGAACA (SEQ ID 
NO:63) 

10 15-17 AGAATGGCAATTATGAACA (SEQ ID 
NO:65) 

1 8-24 CTCTGGC X CTGTCCTAC* (SEQ ID 
NO:67) 

25 ATCAAAGGGTAAAATTCAGA (SEQ ID 
NO:69) 

26 AATTTGCTTTTGAAAGAATC (SEQ ID 
NO:71) 

27,28 AATTTGCTTTTGAAAGAATC (SEQ ID 
NO:73) 

1 5 29,30 TTTTGAGGGCAGGTTCTA (SEQ ID 
NO:75) 

3 1 ,32 TTGCAGGAGTTTGTTTAAT (SEQ ID 
NO:77) 

33 CATTGCAGGAGTTTGTTTA (SEQ ID 
NO:79) 

34 AGAAATAGCCTCTGAAATTC (SEQ ID 
NO:8I) 

35 CCGCCTACTGTATCATAGCA (SEQ ID 
NO:83) 

20 36-38 ATTTTGCCAGTATCTTTTTAG (SEQ ID 



NO:54) 

GAAATGGCATAGGTTGTC (SEQ ID 
NO:56) 



GGCCACACTCAACTGTA (SEQ ID NO:57) CTCAAAAAAAACACAGTAGG (SEQ ID 

NO:58) 

ACTTTTTCTGCCCCTTAT (SEQ ID NO:59) ATATGGAAGCACTTGTAAGTAAA (SEQ 

IDNO:60) 

TTAAGACGAAGGAAACAATTCT (SEQ ID AATGGCATACGTTGTCA (SEQ ID NO:62) 
NO:61) 

TGTGTGCCCTTAAAGTCT (SEQ ID 
NO:64) 

ACCTGAGATAGTGGCTTCC (SEQ ID 
NO:66) 

ACCTGAGATAGTGGCTTCC (SEQ ID 
NO:68) 

CAGCAGCTTGTCACCTAC (SEQ ID 
NO:70) 

GGTAGGCCCAAATACTCA (SEQ ID 
NO:72) 

GGCAGTCCAAAAG AAATA (SEQ ID 
NO:74) 

CACCTCTGGCATG ACTAC (SEQ ID 
NO:76) 

AATGGGACAAATGTAAATGATA (SEQ 
IDNO:78) 

CATCTGAGAACCCTAAGAGA (SEQ ID 
NO:80) 

ATGTCAAATCACAATTCAGTAAGG(SEQ 
IDNO:82) 

GAGTGTACGAGGTTGAGTAAG (SEQ ID 
NO:84) 

CAACGAAATGTCAAATCACAG (SEQ ID 
NO:86) 



NO:85) 

* Note polymorphism in primer. The reference sequence has a "C" at the highlighted 
position. 
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Table 5. UGT1 polymorphisms. Amino acid changes numbered from first methionine for 



that exon (Ex). 






No 


Ex 


NtO 


A A 

AA 


ct?fM Ttrxin? /ct?r\ xv\ vrfY\ 
obv^UbNUb {ot.Kl ID 1NU.J 


I 


1A 


G227 A 


uiy /I ATg 


a t>c* A^Ar.Ap a p.a r,r a tttta r* a /ttty^fo id 
CA I uAvjAvj AC A,uAutAl I 1 I ALAt^ 1 i^c-V uj 










NO:87) 


2 


1 A. 

1 A 




C^ r 7<1 pm 


GOAfTTATTG agp c fTGrATCTGTCT (SEO IDNO'88) 


J 


ir* 

1L 




Tm 11 A vrr 

I rp J 1 Arg 




4 


ii^ 






nr*rpTnn<"»PTfiA a AGTGGAAAGfSFOrDNO^O'l 


5 


1C 


1 154 C 


Vol 47 Ala 
Vol Hf Ala 


ATnrcznn a Crd c pttgpgggagct rsFO ID NO r 9H 


6 


1C 


A 521 G 




r , T/^T'/~ , ^ , /~ , /*~«/^ , ^ % ^ , ^tppt^^PT a a fCRn in MO-07^ 
L 1 C 1 uCOCCjUL- it.U i 1 OUU 1AAU i^bv ^ vi\J*y*) 


7 


ID 


G 848 A 




TACCCCAGGCC A.ATCATGCCCAACA (SEQ ID NO.93) 


8 


ID 


C43T 


Intronic 


TCCAGGCAAAA XACTTTTTAAAAAATG(SEQ ID 












9 


it? 
lb 


T 1 C7 r* 
1 15/ C 


Leu 4o oer 


a r:r* a Trmn.nn a o/^rrT Cleocin a fwn ID NOQVi 


i f\ 
10 


it? 
lb 




ASp jo OIU 


OCvjOOA ii. L 1 LLA J utuAUAUU wbl^ LU LHKJ.yv) 


11 


tc 
lb 


1 J.$JL U 


LBU 0 J iTO 


TnriTn^TrrTP a rrr na AfiaTf* A A (KFCi TF> TsJO'Q7^ 

l UVJ i Uu 1 ICAl-VvV- V. UUnuu 1 unft lis r j 


12 


lb 


A *>C7 n 
AZj/U 




1 AC A 1 LAAAuA UAUAAL 1 I 1 1 1 v^^-V lLJ 










NO:98) 


1^ 

i j 


IP 


C468 A 


His 142 Asn 


TGATCAGGCACCTG A ATGCTACTTCC (SEO ID NO:99) 


14 


IE 


C 517 G 


Ala ISft Glv 

/via uo vJiy 


APPTPTGrfi C GGPGGTGCTGG fSEO ID NO 100) 


i j 


1F 


C689T 




AAGAACATGCT T TACCCTCTGGC (SEO ID NO: 101) 


1 fi 


IE 


C 701 T 




CTCTGGC T CTGTCCTACC fSEO ID NO: 102) 

V> A V^ 1 VJ VJ Vp** V^ X VJ It V^Vv A. iix/v 4 *hFA^Vq * ~ x^ * * 


17 


IF 


C 717 T 




TCCTACCTTTGC T ATGCTGTTTCT (SEO ID NO: 103) 


1R 
1 o 


IE 


C 786 A 


1 en 248 He 


TGTCAGTGGTGGAT A TT TSEO ID NO: 104) 

A VJ A V/iVvJ ± VJ VJ A VJVJaV A * A ^ w*-*%^ * * ■ » w > ^ 


19 


IE 


G789C 


Val 249 Leu 


GGTGGAT A.TT C* TCAGC (SEQ ID NO: 105) 


20 


IE 


C795T 


His 251 Tyr 


TCAGC T ATGCATC (SEQ ID NO: 106) 


21 


IE 


T803 C 


Ser253Phe 


GCATC C GTGTGGCTGTTCCGA (SEQ ID NO:107) 


22 


IE 


G 819C 


Glv 259 Are 


TGGCTGTTCCGA CGGGACTT (SEQ ID NO: 108) 


23 


IE 


T827C 




GGGACTT C.GTGATGGA (SEQ ID NO: 109) 


24 


IE 


T836C 




GTGATGGA C TACCCCAGGCCGAT (SEQ ID NO: 1 1 0) 


25 


IF 


T 161 G 


Ser 7 Ala 

V* # * ft. ALA 


CCTGCCTCCTTCGC (LCATTTCAGAG (SEQ ID NO: 111) 


26 


IF 


A457G 




GCGATCATTCCT £ ACTGCTCCTCAG (SEQ ID N0:1 12) 


27 


IF 


A683G 


Thr 181 Ala 

A 1U A W A A 


CCCTGGAGCAT Q. C ATTC AGC AG (SEQ ID NO:l 13) 


28 


IF 


A694C 


Arc 184 Ser 


CATTCAGCAG C AGCCCAGACCCT (SEQ ID NO: 1 14) 


29 


1G 


T35 G 




TACTTCTTCCAC G TACTATATTA (SEQ ID NO: 1 15) 


30 


1G 


C 124 A 




GGCCTCCTTCG A.CTATATGTGTGT (SEQ ID NO:l 16) 


31 


1G 


T712C 


Trp 208 Arg 


GGAGAGAGTA Q.GGAACCACAT (SEQ ID N0:1 17) 


32 


1G 


G846A 




TCAATTTGGTT 4. TTGCG AACTG A (SEQ ID NO:l 18) 


33 


1H 


G518C 


Gly 173 Ala 


CAGGGGAATAG C.TTGCCACTAT (SEQ ID N0:119) 


34 


1H 


A765G 




TGTTGCGAAC Q, G ACTTTGTTTTGG (SEQ ID NO: 120) 


35 


1J 


G127A 




TTCACCAGCA &.TCGGTGGTGG (SEQ ID NO:121) 


36 


1J 


C694T 




CTAGAAATAGC XTCTCAAATTCTCC (SEQ ID NO: 122) 


37 


1J 


C 731 A 


Leu 244 lie 


CGGCATATGAT A.TCTACAGTCACA (SEQ ID NO: 123) 


38 


1J 


T761 C 


Arg 254 Stop 


TCAATTTGGTTG £ TGCGAACAGG AC (SEQ ID NO:124) 



The asterisk associated with the second nucleotide residue in polymorphism no. 19 



is in the sequence surrounding the newly discovered polymorphism at residue 789 
(nucleotide change from C at residue 786 to A). 
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All publications and patent applications cited in this specification are herein 
incorporated by reference as if each individual publication or patent application were 
specifically and individually indicated to be incorporated by reference. The citation of any 
publication is for its disclosure prior to the filing date and should not be construed as an 
admission that the present invention is not entitled to antedate such publication by virtue of 
prior invention. 

Although the foregoing invention has been described in some detail by way of 
illustration and example for purposes of clarity of understanding, it will be readily apparent 
to those of ordinary skill in the art in light of the teachings of this invention that certain 
changes and modifications may be made thereto without departing from the spirit or scope 
of the appended claims. 
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What is Claimed is: 
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1. An isolated nucleic acid molecule comprising a UGT1 sequence 
polymorphism of SEQ ID NOS: 87-124, as part of other than a naturally occurring 

5 chromosome. 

2. A nucleic acid probe for detection of UGT1 locus polymorphisms, comprising. - 
a polymorphic sequence of SEQ ID NOS:87-124. 

10 3. A nucleic acid probe according to Claim 2, wherein said probe is conjugated 

to a detectable marker. 

4. An array of oligonucleotides comprising: 

two or more probes for detection of UGT1 locus polymorphisms, said probes 
15 comprising at least one form of a polymorphic sequences of SEQ ID NOS: 87-1 24. 

5. A method for detecting in an individual a polymorphism in UGT1 metabolism 
of a substrate, the method comprising: 

analyzing the genome of said individual for the presence of at least one UGT1 
20 polymorphism of SEQ ID NOS:87-124; wherein the presence of said predisposing 
polymorphism is indicative of an alteration in UGT1 expression or activity. 

6. A method according to Claim 5, wherein said analyzing step comprises 
detection of specific binding between the genomic DNA of said individual with an array of 

25 oligonucleotides comprising: 

two or more probes for detection of UGT1 locus polymorphisms, said probes 
comprising at least one form of a polymorphic sequence of SEQ ID NOS:87-124. 

7. A method according to Claim 5, wherein said alteration in UGT1 expression 
30 is tissue specific. 

8. A method according to Claim 5, wherein said alteration in UGT1 expression 
is in response to a UGT1 modifier. 

35 9. A method according to Claim 8, wherein said modifier induces UGT1 

expression. 
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10. A method according to Claim 8, wherein said modifier inhibits UGT1 
expression. 



-23- 



WO 99/57322 



PCT/US99/09702 



1/3 



o 

OS 



o 



o 



oo 



II 



^ 1 1 

m -e ■§ K 

S — K °- 

™ -C >i 

ma© 



2 

«3 « 4> 

I M 



* 



o 

Li. 



4-- 



^ — Uj NO 

,2 ^ 

4.- ? 

«. w 

4.- = 



WO 99/57322 



PCT/US99/09702 



2/3 



oo .5 K 

s § 

«-t J= 

r- OQ a 



a- 




48- 



o 



3 5 

I- 5 
s a "8 

Egg 

CQ q. £ 




CM 
O 



to 

B s 



W H < - 



11—4.1 



cn r- — T_ 1 



WO 99/57322 



PCT/US99/09702 




WO 99/57322 



PCT/US99/09702 



SEQUENCE LISTING 

<110> Penny, Laura 

Galvin, Margaret 

<120> Genotyping the Human 

UDP-Glucuronosyltransf erase 1 (UGT1) Gene 



<130> SEQ-17WO 

<140> Unassigned 
<141> 1999-05-04 

<150> 60/084,807 
<151> 1998-05-07 

<160> 124 

<170> FastSEQ for Windows Version 3.0 

<210> 1 

<211> 864 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . . (864) 

<400> 1 

atg get gtg gag tec cag ggc gga cgc cca ctt gtc ctg ggc ctg ctg '48 

Met Ala Val Glu Ser Gin Gly Gly Arg Pro Leu Val Leu Gly Leu Leu 
1 5 10 15 

ctg tgt gtg ctg ggc cca gtg gtg tec cat get ggg aag ata ctg ttg 96 
Leu Cys Val Leu Gly Pro Val Val Ser His Ala Gly Lys lie Leu Leu 
20 25 30 

ate cca gtg gat ggc age cac tgg ctg age atg ctt ggg gee ate cag 14 4 

He Pro Val Asp Gly Ser His Trp Leu Ser Met Leu Gly Ala He Gin 
35 40 45 

cag ctg cag cag agg gga cat gaa ata gtt gtc eta gca cct gac gee 192 
Gin Leu Gin Gin Arg Gly His Glu He Val Val Leu Ala Pro Asp Ala 
50 55 60 

teg ttg tac ate aga gac gga gca ttt tac acc ttg aag acg tac cct 240 
Ser Leu Tyr He Arg Asp Gly Ala Phe Tyr Thr Leu Lys Thr Tyr Pro 
65 70 75 80 

gtg cca ttc caa agg gag gat gtg aaa gag tct ttt gtt agt etc ggg 288 
Val Pro Phe Gin Arg Glu Asp Val Lys Glu Ser Phe Val Ser Leu Gly 
85 - 90 95 

cat aat gtt ttt gag aat gat tct ttc ctg cag cgt gtg ate aaa aca 336 
His Asn Val Phe Glu Asn Asp Ser Phe Leu Gin Arg Val He Lys Thr 
100 105 110 

tac aag aaa ata aaa aag gac tct get atg ctt ttg tct ggc tgt tec 384 
Tyr Lys Lys He Lys Lys Asp Ser Ala Met Leu Leu Ser Gly Cys Ser 
115 120 125 
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SEQUENCE LISTING 

<110> Penny, Laura 

Galvin, Margaret 

<120> Genotyping the Human 

UDP-Glucuronosyltransferase 1 (UGT1) Gene 



<130> SEQ-17WO 

<140> Unassigned 
<141> 1999-05-04 

<150> 60/084,807 
<151> 1998-05-07 

<160> 124 

<170> FastSEQ for Windows Version 3.0 

<210> 1 

<211> 864 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> {!)... (864) 

<400> 1 

atg get gtg gag tec cag ggc gga cgc cca ctt gtc ctg ggc ctg ctg '48 

Met Ala Val Glu Ser Gin Gly Gly Arg Pro Leu Val Leu Gly Leu Leu 

1 5 10 15 

ctg tgt gtg ctg ggc cca gtg gtg tec cat get ggg aag ata ctg ttg 96 
. Leu Cys Val Leu Gly Pro Val Val Ser His Ala Gly Lys lie Leu Leu 
20 25 30 

ate cca gtg gat ggc age cac tgg ctg age atg ctt ggg gec ate cag 144 
lie Pro Val Asp Gly Ser His Trp Leu Ser Met Leu Gly Ala lie Gin 
35 40 45 

cag ctg cag cag agg gga cat gaa ata gtt gtc eta gca cct gac gee 192 
Gin Leu Gin Gin Arg Gly His Glu lie Val Val Leu Ala Pro Asp Ala 
50 55 60 

teg ttg tac ate aga gac gga gca ttt tac ace ttg aag acg tac cct 24 0 

Ser Leu Tyr He Arg Asp Gly Ala Phe Tyr Thr Leu Lys Thr Tyr Pro 
65 70 75 80 

gtg cca ttc caa agg gag gat gtg aaa gag tct ttt gtt agt etc ggg 288 
Val Pro Phe Gin Arg Glu Asp Val Lys Glu Ser Phe Val Ser Leu Gly 
85 . 90 95 

cat aat gtt ttt gag aat gat tct ttc ctg cag cgt gtg ate aaa aca 336 
His Asn Val Phe Glu Asn Asp Ser Phe Leu Gin Arg Val He Lys Thr 
100 105 HO 

tac aag aaa ata aaa aag gac tct get atg ctt ttg tct ggc tgt tec 384 
Tyr Lys Lys He Lys Lys Asp Ser Ala Met Leu Leu Ser Gly Cys Ser 
115 120 125 
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cac tta ctg cac aac aag gag etc atg gec tec ctg gca gaa age age 4 32 

His Leu Leu His Asn Lys Glu Leu Met Ala Ser Leu Ala Glu Ser Ser 
130 135 140 

ttt gat gtc atg ctg acg gac cct ttc ctt cct tgc age ccc ate gtg 4 80 

Phe Asp Val Met Leu Thr Asp Pro Phe Leu Pro Cys Ser Pro lie Val 
!45 150 155 160 

gee cag tac ctg tct ctg ccc act gta ttc ttc ttg cat gca ctg cca 528 
Ala Gin Tyr Leu Ser Leu Pro Thr Val Phe Phe Leu His Ala Leu Pro 
165 170 175 

tgc age ctg gaa ttt gag get ace cag tgc ccc aac cca ttc tec tac 576 
Cys Ser Leu Glu Phe Glu Ala Thr Gin Cys Pro Asn Pro Phe Ser Tyr 
180 185 190 

gtg ccc agg cct etc tec tct cat tea gat cac atg ace ttc ctg cag 624 
Val Pro Arg Pro Leu Ser Ser His Ser Asp His Met Thr Phe Leu Gin 
195 200 205 

egg gtg aag aac atg etc att gec ttt tea cag aac ttt ctg tgc gac 672 
Arg Val Lys Asn Met Leu lie Ala Phe Ser Gin Asn Phe Leu Cys Asp 
210 215 220 

gtg gtt tat tec ccg tat gca ace ctt gec tea gaa ttc ctt cag aga 720 
Val Val Tyr Ser Pro Tyr Ala Thr Leu Ala Ser Glu Phe Leu Gin Arg 
225 230 235 240 

gag gtg act gtc cag gac eta ttg age tct gca tct gtc tgg ctg ttt 768 
Glu Val Thr Val Gin Asp Leu Leu Ser Ser Ala Ser Val Trp Leu Phe 
245 250 255 

aga agt gac ttt gtg aag gat tac cct agg ccc ate atg ccc aat atg 816 
Arg Ser Asp Phe Val Lys Asp Tyr Pro Arg Pro lie Met Pro Asn Met 
260 265 270 

gtt ttt gtt ggt gga ate aac tgc ctt cac caa aat cca eta tec cag 864 
Val Phe Val Gly Gly He Asn Cys Leu His Gin Asn Pro Leu Ser Gin 
275 280 285 



<210> 2 

<211> 288 

<212> PRT 

<213> Homo sapiens 



<400> 2 



Met 


Ala 


Val 


Glu 


Ser Gin Gly Gly 


1 








5 


Leu Cys 


Val 


Leu 


Gly Pro Val Val 








20 




He 


Pro 


Val 


Asp 


Gly Ser His Trp 






35 




40 


Gin 


Leu 


Gin 


Gin 


Arg Gly His Glu 




50 






.55 


Ser 


Leu 


Tyr 


He 


Arg Asp Gly Ala 


65 








70 


Val 


Pro 


Phe 


Gin 


Arg Glu Asp Val 










85 


His 


Asn 


Val 


Phe 


Glu Asn Asp Ser 








100 


Tyr Lys 


Lys 


He 


Lys Lys Asp Ser 






115 




120 



Arg Pro Leu Val Leu Gly Leu Leu 

10 15 
Ser His Ala Gly Lys He Leu Leu 
25 30 
Leu Ser Met Leu Gly Ala He Gin 
45 

He Val Val Leu Ala Pro Asp Ala 
60 

Phe Tyr Thr Leu Lys Thr Tyr Pro 

75 80 
Lys Glu Ser Phe Val Ser Leu Gly 

90 95 
Phe Leu Gin Arg Val He Lys Thr 
105 no 
Ala Met Leu Leu Ser Gly Cys Ser 
125 
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rti/ua 


His 


Leu 
130 


Leu His 


Asn 


Lys 


Glu Leu 
135 


Met Ala 


Ser 


Leu 
140 


Ala 


Glu 


Ser 


Ser 


Phe 


Asp 


Val Met 


Leu 


Thr 


Asp Pro 


Phe Leu 


Pro Cys 


Ser 


Pro 


He 


va± 


145 






150 






155 












Ala 


Gin 


Tyr Leu 


Ser 


Leu 


Pro Thr 


Val Phe 


Phe 


Leu 


His 


Ala 


Leu 


Fro 






165 






170 










175 




Cys 


Ser 


Leu Glu 


Phe 


Glu 


Ala Thr Gin Cys 


Pro 


Asn 


Pro 


Phe 


Ser 


Tyr 




180 








185 








190 






Val 


Pro 


Arg Pro 


Leu 


Ser 


Ser His 


Ser Asp His Met 


Thr 


Phe 


Leu 


bin 






195 






200 








205 








Arg 


Val 


Lys Asn 


Met 


Leu 


He Ala 


Phe Ser 


Gin 


Asn 


Phe 


Leu Cys 


Asp 


210 








215 






220 










Val 


Val 


Tyr Ser 


Pro 


Tvr 
* y x 


Ala Thr 


Leu Ala 


Ser 


Glu 


Phe 


Leu 


Gin 


Arg 


225 






230 






235 










240 


Glu 


Val 


Thr Val 


Gin 


Asp 


Leu Leu 


Ser Ser 


Ala 


Ser 


Val 


Trp Leu 


Phe 








245 






250 










255 




Arg 


Ser 


Asp Phe 


Val 


Lys 


Asp Tyr Pro Arg Pro 


He 


Met 


Pro 


Asn 


Met 




260 








.265 








270 






Val 


Phe 


Val Gly 
275 


Gly 


lie 


Asn Cys 
280 


Leu His 


Gin 


Asn 


Pro 

285 


Leu 


Ser 


Gin 
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<210> 3 

<211> 867 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . . (867) 

<400> 3 

atg gcc aca gga etc cag gtt ccc ctg ccg tgg ctg gec aca gga ctg 48 
Met Ala Thr Gly Leu Gin Val Pro Leu Pro Trp Leu Ala Thr Gly Leu 
15 10 15 

ctg ctt etc etc agt gtc cag ccc tgg get gag agt gga aag gtg ttg 96 
Leu Leu Leu Leu Ser Val Gin Pro Trp Ala Glu Ser Gly Lys Val Leu 
20 25 30 

gtg gtg ccc att gat ggc age cac tgg etc age atg egg gag gtc ttg 144 
Val Val Pro He Asp Gly Ser His Trp Leu Ser Met Arg Glu Val Leu 
35 40 45 

egg gag etc cat gcc aga ggc cac cag gca gtg gtc etc acc cca gag 192 
Arg Glu Leu His Ala Arg Gly His Gin Ala Val Val Leu Thr Pro Glu 
50 55 60 

gtg aat atg cac ate aaa gaa gag aac ttt ttc acc ctg aca acc tat 240 
Val Asn Met His He Lys Glu Glu Asn Phe Phe Thr Leu Thr Thr Tyr 
65 70 75 80 

gcc att teg tgg acc cag gat gaa ttt gat cgc cat gtg ctg ggc cac 288 
Ala He Ser Trp Thr Gin Asp Glu Phe Asp Arg His Val Leu Gly His 
85 90 95 

act caa ctg tac ttt gaa aca gaa cat ttt ctg aag aaa ttt ttc aga 336 
Thr Gin Leu Tyr Phe Glu Thr Glu His Phe Leu Lys Lys Phe Phe Arg 
100 105 110 

agt atg gca atg ttg aac aat atg tct ttg gtc tat cat agg tct tgt 384 
Ser Met Ala Met Leu Asn Asn Met Ser Leu Val Tyr His Arg Ser Cys 
115 120 125 
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gtg gag eta eta cat aat gag gec ctg ate agg cac ctg aat get act 432 

Val Glu Leu Leu His Asn Glu Ala Leu He Arg His Leu Asn Ala Thr 

130 135 140 

tec ttt gat gtg gtt tta aca gac ccc gtt aac etc tgc gcg gca gtg 480 

Ser Phe Asp Val Val Leu Thr Asp Pro Val Asn Leu Cys Ala Ala Val 

145 150 155 160 

ctg get aag tac ctg teg att cct act gtg ttt ttt ttg agg aac att 528 

Leu Ala Lys Tyr Leu Ser lie Pro Thr Val Phe Phe Leu Arg Asn He 
165 170 175 

cca tgt gat tta gac ttt aag ggc aca cag tgt cca aac cct tec tec 576 

Pro Cys Asp Leu Asp Phe Lys Gly Thr Gin Cys Pro Asn Pro Ser Ser 

180 185 190 

tat att cct aga tta eta aca ace aat tea gac cac atg aca ttc atg 624 

Tyr He Pro Arg Leu Leu Thr Thr Asn Ser Asp His Met Thr Phe Met 

195 200' 205 

caa agg gtc aag aac atg etc tac cct ctg gec ctg tec tac att tgc 672 

Gin Arg Val Lys Asn Met Leu Tyr Pro Leu Ala Leu Ser Tyr He Cys 

210 215 220 

cat get ttt tct get cct tat gca age ctt gee tct gag ctt ttt cag 720 

His Ala Phe Ser Ala Pro Tyr Ala Ser Leu Ala Ser Glu Leu Phe Gin 

225 230 235 240 

aga gag gtg tea gtg gtg gat att etc agt cat gca tct gtg tgg ctg 768 

Arg Glu Val Ser Val Val Asp He Leu Ser His Ala Ser Val Trp Leu 
245 250 255 

ttc cga ggg gac ttt gtg atg gac tac ccc agg cca ate atg ccc aac 816 

Phe Arg Gly Asp Phe Val Met Asp Tyr Pro Arg Pro He Met Pro Asn 

260 265 270 

atg gtc ttc att ggg ggc ate aac tgt gec aac agg aag cca eta tct 8 64 

Met Val Phe He Gly Gly He Asn Cys Ala Asn Arg Lys Pro Leu Ser 

275 280 285 

cag 867 
Gin 



<210> 4 
<211> 289 
<212> PRT 

<213> Homo sapiens 
<400> 4 

Met Ala Thr Gly Leu Gin Val Pro Leu Pro Trp Leu Ala Thr Gly Leu 

15 10 15 

Leu Leu Leu Leu Ser Val Gin Pro Trp Ala Glu Ser Gly Lys Val Leu 

20 . 25 30 

Val Val Pro He Asp Gly Ser His Trp Leu Ser Met Arg Glu Val Leu 

35 40 45 

Arg Glu Leu His Ala Arg Gly His Gin Ala Val Val Leu Thr Pro Glu 

50 55 60 

Val Asn Met His He Lys Glu Glu Asn Phe Phe Thr Leu Thr Thr Tyr 
65 70 75 80 

Ala He Ser Trp Thr Gin Asp Glu Phe Asp Arg His Val Leu Gly His 
85 90 95 
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Thr 


Gin 


Leu 


Tvr Phe Glu Thr 


Glu 


Hi c 








100 






Ser 


Met 


Ala 


Met Leu Asn Asn 


Met 


Ser 






115 




120 




Val 


Glu 


Leu 


Leu His Asn CI n 


Ala 


.Leu 




130 




135 






Ser 


Phe 


Asp 


Val Val Leu Thr 


Asp 


Pro 


145 






150 




Leu 


Ala 


Lys 


Tvr Leu Ser Tip 


Pro 


Thr 








165 






Pro 


Cys 


Asp 


Leu Aso Phe Lvs 


Gly 


Thr 








180 




185 


Tvr 
y 


lie 


Pro 


Arg Leu Leu Thr 


Thr 


Asn 






195 








Gin 


Arg 


Val 


Lys Asn Met Leu 


Tvr 


Pro 




210 




^ X -J 






mo 




Phe 


Qpr Ala Prn Tur* 
OCX ni a t x u i yi 




Ser 


225 






230 






Arg 


Glu 


Val 


Ser Val Val Asp 


lie 


Leu 








245 






Phe 


Arg 


Gly 


Asp Phe Val Met 


Asp 


Tyr 








260 




265 


Met 


Val 


Phe 


lie Gly Gly lie 


Asn 


Cys 






275 




280 




Gin 
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irne 


Leu 


Lys 


Lys 


Pne 
110 


Pne 


Arg 


Leu 


Val 


Tyr 


His 
LZd 


Arg 


Ser 


Cys 


lie 


Arg 


His 

i in 

X 4U 


Leu 


Asn 


Ala 


Thr 


Val 


Asn 

ISO 


Leu 


Cys 


Ala 


Ala 


Val 
lou 


Val 


Phe 


Phe 


Leu 


Arg 


Asn 


He 


1 7fl 
I rU 










1 ID 




t»j.n 


Cys 


fro 


Asn 


fro 

J. 


Ser 


Ser 


Ser 


Asp 


His 


Met 


Thr 


Phe 


Met 


Leu 


Ala 


Leu 


Ser 


Ty r 


He 


Cys 


Leu 


Ala 
235 


Ser 


Glu 


Leu 


Phe 


Gin 
240 


Ser 


His 


Ala 


Ser 


Val 


Trp 


Leu 


250 










255 




Pro 


Arg 


Pro 


He 


Met 
270 


Pro 


Asn 


Ala 


Asn 


Arg 


Lys 
285 


Pro 


Leu 


Ser 



<210> 5 

<211> 867 

<212> DNA 

<213> Homo sapiens 





<220> 
<221> 
<222> 


CDS 

(1) . 


. (867) 


























<400> 


5 




























atg 
Met 
1 


gec 
Ala 


aga 
Arg 


gga 
Gly 


etc 
Leu 
5 


cag 
Gin 


gtt 
Val 


ccc 
Pro 


ctg 
Leu 


ccg 
Pro 
10 


egg 
Arg 


ctg 
Leu 


gee 
Ala 


aca 
Thr 


gga 
Gly 
15 


ctg 
Leu 


48 


ctg 
Leu 


etc 
Leu 


etc 
Leu 


etc 
Leu 
20 


agt 
Ser 


gtc 
Val 


cag 
Gin 


ccc 
Pro 


tgg 
Trp 
25 


get 
Ala 


gag 
Glu 


agt 
Ser 


gga 
Gly 


aag 
Lys 
30 


gtg 
Val 


ttg 
Leu 


96 


gtg 
Val 


gtg 
Val 


ccc 
Pro 
35 


act 
Thr 


gat 
Asp 


ggc 
Gly 


age 
Ser 


ccc 
Pro 
40 


tgg 
Trp 


etc 
Leu 


age 
Ser 


atg 
Met 


egg 
Arg 
45 


gag 
Glu 


gec 
Ala 


ttg 
Leu 


144 


egg 
Arg 


gag 
Glu 
50 


etc 
Leu 


cat 
His 


gee 
Ala 


aga 
Arg 


ggc 
Gly 
55 


cac 
His 


cag 
Gin 


gcg 
Ala 


gtg 
Val 


gtc 
Val 
60 


etc 
Leu 


acc 
Thr 


cca 
Pro 


gag 
Glu 


192 


gtg 
Val 
65 


aat 
Asn 


atg 
Met 


cac 
His 


ate 
He 


aaa 
Lys. 
70 


gaa 
Glu 


gag 
Glu 


aaa 
Lys 


ttt 
Phe 


ttc 
Phe 
75 


acc 
Thr 


ctg 
Leu 


aca 
Thr 


gec 
Ala 


tat 
Tyr 
80 


240 


get 
Ala 


gtt 
Val 


cca 
Pro 


tgg 
Trp 


acc 
Thr 
85 


cag aag 
Gin Lys 


gaa 
Glu 


ttt 
Phe 


gat 
Asp 
90 


cgc 
Arg 


gtt 
Val 


acg 
Thr 


ctg 
Leu 


ggc 
Gly 
95 


tac 
Tyr 


288 


act 
Thr 


caa 
Gin 


ggg 

Gly 


ttc 
Phe 
100 


ttt 
Phe 


gaa 
Glu 


aca 
Thr 


gaa 
Glu 


cat 
His 
105 


ctt 
Leu 


ctg 
Leu 


aag 
Lys 


aga 
Arg 


tat 
Tyr 
110 


tct 
Ser 


aga 
Arg 


336 
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agt atg gca att atg aac aat gta tct ttg gcc ctt cat agg tgt tgt 384 
Ser Met Ala He Met Asn Asn Val' Ser Leu Ala Leu His Arg Cys Cys 
115 120 125 

gtg gag eta ctg cat aat gag gcc ctg ate agg cac ctg aat get act 432 
Val Glu Leu Leu His Asn Glu Ala Leu He Arg His Leu Asn Ala Thr 
130 135 140 

tec ttt gat gtg gtt tta aca gac ccc gtt aac etc tgt ggg gcg gtg 480 
Ser Phe Asp Val Val Leu Thr Asp Pro Val Asn Leu Cys Gly Ala Val 
145 150 155 160 

ctg get aag tac ctg teg att cct get gtg ttt ttt tgg agg tac att 528 
Leu Ala Lys Tyr Leu Ser He Pro Ala Val Phe Phe Trp Arg Tyr He 
165 170 175 

cca tgt gac tta gac ttt aag ggc aca cag tgt cca aat cct tec tec 576 
Pro Cys Asp Leu Asp Phe Lys Gly ' Thr Gin Cys Pro Asn Pro Ser Ser 
180 185 190 

tat att cct aag tta eta acg ace aat tea gac cac atg aca ttc ctg 624 
Tyr He Pro Lys Leu Leu Thr Thr Asn Ser Asp His Met Thr Phe Leu 
195 200 205 

caa agg gtc aag aac atg etc tac cct ctg gcc ctg tec tac att tgc 672 
Gin Arg Val Lys Asn Met Leu Tyr Pro Leu Ala Leu Ser Tyr He Cys 
210 215 220 

cat act ttt tct gcc cct tat gca agt ctt gcc tct gag ctt ttt bag 720 
His Thr Phe Ser Ala Pro Tyr Ala Ser Leu Ala Ser Glu Leu Phe Gin 
225 230 235 240 

aga gag gtg tea gtg gtg gat ctt gtc age tat gca tec gtg tgg ctg 768 
Arg Glu Val Ser Val Val Asp Leu Val Ser Tyr Ala Ser Val Trp Leu 
245 250 255 

ttc cga ggg gac ttt gtg atg gac tac ccc agg ccg ate atg ccc aac 816 
Phe Arg Gly Asp Phe Val Met Asp Tyr Pro Arg Pro He Met Pro Asn 
260 265 270 

atg gtc ttc att ggg ggc ate aac tgt gcc aac ggg aag cca eta tct 864 
Met Val Phe He Gly Gly He Asn Cys Ala Asn Gly Lys Pro Leu Ser 
275 280 285 

cag . 867 

Gin 



<210> 6 

<211> 289 

<212> PRT 

<213> Homo sapiens 

<400> 6 

Met Ala Arg Gly Leu Gin Val Pro Leu Pro Arg Leu Ala Thr Gly Leu 

1 5 io 15 

Leu Leu Leu Leu Ser Val Gin Pro Trp Ala Glu Ser Gly Lys Val Leu 

20 25 30 

Val Val Pro Thr Asp Gly Ser Pro Trp Leu Ser Met Arg Glu Ala Leu 

35 40 45 

Arg Glu Leu His Ala Arg Gly His Gin Ala Val Val Leu Thr Pro Glu 
50 55 60 
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Val Asn Met His lie Lys Glu Glu Lys Phe Phe Thr Leu Thr Ala Tyr 
65 70 75 80 

Ala Val Pro Trp Thr Gin Lys Glu Phe Asp Arg Val Thr Leu Gly Tyr 

85 90 95 

Thr Gin Gly Phe Phe Glu Thr Glu His Leu Leu Lys Arg Tyr Ser Arg 

100 105 110 

Ser Met Ala He Met Asn Asn Val Ser Leu Ala Leu His Arg Cys Cys 

115 120 125 

Val Glu Leu Leu His Asn Glu Ala Leu He tog His Leu Asn Ala Thr 

130 135 140 

Ser Phe Asp Val Val Leu Thr Asp Pro Val Asn Leu Cys Gly Ala Val 
145 150 155 160 

Leu Ala Lys Tyr Leu Ser He Pro Ala Val Phe Phe Trp Arg Tyr He 

165 170 175 

Pro Cys Asp Leu Asp Phe Lys Gly Thr Gin Cys Pro Asn Pro Ser Ser 

180 185 190 

Tyr He Pro Lys Leu Leu Thr Thr Asn Ser Asp His Met Thr Phe Leu 

195 200 . 205 

Gin Arg Val Lys Asn Met Leu Tyr Pro Leu Ala Leu Ser Tyr He Cys 

210 215 220 

His Thr Phe Ser Ala Pro Tyr Ala Ser Leu Ala Ser Glu Leu Phe Gin 
225 230 235 240 

Arg Glu Val Ser Val Val Asp Leu Val Ser Tyr Ala Ser Val Trp Leu 

245 250 255 

Phe Arg Gly Asp Phe Val Met Asp Tyr Pro Arg Pro He Met Pro Asn 

260 265 270 

Met Val Phe He Gly Gly He Asn Cys Ala Asn Gly Lys Pro Leu Ser 
275 280 285 

Gin 



<210> 7 

<211> 867 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . (867) 

<400> 7 

atg gcc aca gga etc cag gtt ccc ctg ccg cag ctg gec aca gga ctg 

Met Ala Thr Gly Leu Gin Val Pro Leu Pro Gin Leu Ala Thr Gly Leu 

15 10 15 

ctg ctt etc etc agt gtc cag ccc tgg get gag agt ggg aag gtg ctg 
Leu Leu Leu Leu Ser Val Gin Pro Trp Ala Glu Ser Gly Lys Val Leu 
20 25 30 



egg gac etc cat gcg aga ggc cac cag gtg gtg gtc etc acc ctg gag 
Arg Asp Leu His Ala Arg. Gly His Gin Val Val Val Leu Thr Leu Glu 
50 55 60 

gtg aat atg tac ate aaa gaa gag aac ttt ttc acc ctg aca acg tat 
Val Asn Met Tyr He Lys Glu Glu Asn Phe Phe Thr Leu Thr Thr Tyr 
65 70 75 80 

gcc att tea tgg acc cag gac gaa ttt gat cgc ctt ttg ctg ggt cac 
Ala He Ser Trp Thr Gin Asp Glu Phe Asp Arg Leu Leu Leu Gly His 
85 90 95 



48 



96 



gtg gtg ccc act gat ggc age cac tgg etc age atg egg gag gcc ttg 144 
Val Val Pro Thr Asp Gly Ser His Trp Leu Ser Met Arg Glu Ala Leu 
35 40 45 



192 



240 



288 
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act caa teg ttc ttt gaa aca gaa cat ctt ctg atg aaa ttt tct aga 336 
Thr Gin Ser Phe Phe Glu Thr Glu His Leu Leu Met Lys Phe Ser Arg 
100 105 110 

aga atg gca att atg aac aat atg tct ttg ate ata cat agg tct tgt 384 
Arg Met Ala lie Met Asn Asn Met Ser Leu lie lie His Arg Ser Cys 
115 120 125 

gtg gag eta ctg cat aat gag gec ctg ate agg cac ctg cat get act 432 
Val Glu Leu Leu His Asn Glu Ala Leu lie Arg His Leu His Ala. Thr 
130 135 140 

tec ttt gat gtg gtt eta aca gac ccc ttt cac etc tgc gcg gcg gtg 480 
Ser Phe Asp Val Val Leu Thr Asp Pro Phe His Leu Cys Ala Ala Val 
145 150 155 160 

ctg get aag tac ctg teg att cct get gtg ttt ttc ttg agg aac att 528 
Leu Ala Lys Tyr Leu Ser He Pro Ala Val Phe Phe Leu Arg Asn He 
165 170 175 

cca tgt gat tta gac ttt aag ggc aca cag tgt cca aac cct tec tec 576 
Pro Cys Asp Leu Asp Phe Lys Gly Thr Gin Cys Pro Asn Pro Ser Ser 
180 185 190 

tat att cct aga tta eta acg acc aat tea gac cac atg aca ttc ctg 624 
Tyr He Pro Arg Leu Leu Thr Thr Asn Ser Asp His Met Thr Phe Leu 
195 200 205 

caa agg gtc aag aac atg etc tac cct ctg gec ctg tec tac ctt tgc 672 
Gin Arg Val Lys Asn Met Leu Tyr Pro Leu Ala Leu Ser Tyr Leu Cys 
210 215 220 

cat get gtt tct get cct tat gca age ctt gee tct gag ctt ttt cag 720 
His Ala Val Ser Ala Pro Tyr Ala Ser Leu Ala Ser Glu Leu Phe Gin 
225 230 235 240 

aga gag gtg tea gtg gtg gat ctt gtc age cat gca tct gtg tgg ctg 768 
Arg Glu Val Ser Val Val Asp Leu Val Ser His Ala Ser Val Trp Leu 
245 250 255 

ttc cga ggg gac ttt gtg atg gat tac ccc agg ccg ate atg ccc aac 816 
Phe Arg Gly Asp Phe Val Met Asp Tyr Pro Arg Pro He Met Pro Asn 
260 265 270 

atg gtc ttc att ggg ggc ate aac tgt gee aac ggg aag cca eta tct 864 
Met Val Phe He Gly Gly He Asn Cys Ala Asn Gly Lys Pro Leu Ser 
275 280 285 

cag 867 
Gin 



<210> 8 

<211> 289 

<212> PRT 

<213> Homo sapiens 

<400> 8 

Met Ala Thr Gly Leu Gin Val Pro 

1 5 
Leu Leu Leu Leu Ser Val Gin Pro 
20 



Leu Pro Gin Leu Ala Thr Gly Leu 

10 15 
Trp Ala Glu Ser Gly Lys Val Leu 
25 30 
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Val 


Val 


Pro 


Thr 


Asp Gly Ser 


His 






35 










40 


Arg 


Asp 


Leu 


His Ala Arg Gly 


His 




50 










55 




Val 


Asn 


Met 


xyi. 


He 


Lys 


Glu 


Glu 


65 










70 






Ala 


He 


Ser 


Trp 


Thr 


Gin Asp 


Glu 










85 








Thr 


Gin 


Ser 


Phe 


Phe 


Glu 


Thr 


Glu 








100 










Arq 


Met 


Ala 


He 


Met 


Asn 


Asn 


Met 






115 










120 


Val 


Glu 


Leu 


Leu 


His 


Asn 


Glu 


Ala 




130 










135 




Ser 


Phe 


Asp 


Val 


Val 


Leu 


Thr 


Asp 


145 










150 




Leu 


Ala 


Lvs 


Tyr Leu 


Ser 


He 


Pro 










165 








Pro 


Cvs 


Asp 


Leu Asp 


Phe 


Lys 


Glv 








180 










Tvx 


He 


Pro 


Arg 


Leu 


Leu 


Thr 


Thr 






195 










200 


Gin 


Ara 


Val 


Lys 


Asn 


Met 


Leu 


Tvr 




210 










215 




His 


Ala 


Val 


Ser 


Ala 


Pro 


Tyr 


Ala 


225 










230 






Arg 


Glu 


Val 


Ser 


Val 


Val 


Asp 


Leu 










245 








Phe 


Arg 


Gly 


Asp 


Phe 


Val 


Met 


Asp 








260 










Met 


Val 


Phe 


He 


Gly Gly He 


Asn 






275 










280 


Gin 

















1 .LJJ 


Leu 


Ser 


Met 


Arg 


Glu 


Ala 


ucu 


















Gin 


Val 


Val 


Val 


Leu 


Thr 


Leu 


Glu 








60 










noil 


IT lie 


true 


Thr 
i ill. 


L*eu 


Thr 


Thr 


Tyr 






/ -j 












Phe 




Arg 


Jjeu 


.Leu 


- 

Leu 






















His 


Leu 


1>6U 




- 

j-»ys 


Ph P 




Arg 


105 










110 






Ser 


Leu 


He 


He 


His 


Arg 


Cay 
OCX. 












125 








Leu 


He 


Arg 


His 


Leu 


His 


Ala 


Thr 








140 










Pro 


Phe 


His 


Leu 


Cys 


Ala 


Ala 


Val 






155 










160 


Ala 


Val 


Phe 


Phe 


Leu 


Arg 


Asn 


He 




170 










175 




Thr 


Gin 


Cys 


Pro 


As 




OCi. 


Ser 


185 










190 






Asn 


Ser 


Asp 


His 


Met 


Thr 


Phe 


Leu 










205 








Pro 


Leu 


Ala 


Leu 


Ser 


Tvr 


Leu 


Cvs 








220 










Ser 


Leu 


Ala 


Ser 


Glu 


Leu 


Phe 


Gin 






235 










240 


Val 


Ser 


His 


Ala 


Ser 


Val 


Trp 


Leu 




250 










255 




Tyr 


Pro 


Arg 


Pro 


He 


Met 


Pro 


Asn 


265 










270 






Cys 


Ala 


Asn 


Gly 


Lys 


Pro 


Leu 


Ser 



285 



<210> 9 

<211> 861 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . . (861) 

<400> 9 

atg gec tgc etc ctt cgc tea ttt cag aga att tct gca ggg gtt ttc 48 

Met Ala Cys Leu Leu Arg Ser Phe Gin Arg He Ser Ala Gly Val Phe 
15 10 15 

ttc tta gca ctt tgg ggc atg gtt gta ggt gac aag ctg ctg gtg gtc 96 
Phe Leu Ala Leu Trp Gly Met Val Val Gly Asp Lys Leu Leu Val Val 
20 25 30 

cct cag gac gga age cac tgg ctt agt atg aag gat ata gtt gag gtt 14 4 

Pro Gin Asp Gly Ser His Trp Leu Ser Met Lys Asp He Val Glu Val 
35 40 45 

etc agt gac egg ggt cat gag att gta gtg gtg gtg cct gaa gtt aat 192 
Leu Ser Asp Arg Gly His Glu He Val Val Val Val Pro Glu Val Asn 
50 55 60 

ttg ctt ttg aaa gaa tec aaa tac tac aca aga aaa ate tat cca gtg 240 
Leu Leu Leu Lys Glu Ser Lys Tyr Tyr Thr Arg Lys He Tyr Pro Val 
65 70 75 80 
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ccg tat gac caa gaa gag ctg aag aac cgt tac caa tea ttt gga aac 288 
Pro Tyr Asp Gin Glu Glu Leu Lys Asn Arg Tyr Gin Ser Phe Gly Asn 
85 90 95 

aat cac ttt get gag cga tea ttc eta act get cct cag aca gag tac 336 
Asn His Phe Ala Glu Arg Ser Phe Leu Thr Ala Pro Gin Thr Glu Tyr 
100 105 HO 

agg aat aac atg att gtt att ggc ctg tac ttc ate aac tgc cag age 384 
Arg Asn Asn Met lie Val lie Gly Leu Tyr Phe He Asn Cys Gin Ser 
115 120 125 

etc ctg cag gac agg gac acc ctg aac ttc ttt aag gag age aag ttt 432 
Leu Leu Gin Asp Arg Asp Thr Leu Asn Phe Phe Lys Glu Ser Lys Phe 
130 135 140 

gat get ctt ttc aca gac cca gectta ccc tgt ggg gtg ate ctg get 480 
Asp Ala Leu Phe Thr Asp Pro Ala Leu Pro Cys Gly Val He Leu Ala 
145 150 155 160 

gag tat ttg ggc eta cca tct gtg tac etc ttc agg ggt ttt ccg tgt 528 
Glu Tyr Leu Gly Leu Pro Ser Val Tyr Leu Phe Arg Gly Phe Pro Cys 
165 170 175 

tec ctg gag cat aca ttc age aga age cca gac cct gtg tec tac att 576 
Ser Leu Glu His Thr Phe Ser Arg Ser Pro Asp Pro Val Ser Tyr He 
180 185 190 

ccc agg tgc tac aca aag ttt tea gac cac atg act ttt tec caa cga 624 
Pro Arg Cys Tyr Thr Lys Phe Ser Asp His Met Thr Phe Ser Gin Arg 
195 200 205 

gtg gee aac ttc ctt gtt aat ttg ttg gag ccc tat eta ttt tat tgt 672 
Val Ala Asn Phe Leu Val Asn Leu Leu Glu Pro Tyr Leu Phe Tyr Cys 
210 215 220 

ctg ttt tea aag tat gaa gaa etc gca tea get gtc etc aag aga gat 720 
Leu Phe Ser Lys Tyr Glu Glu Leu Ala Ser Ala Val Leu Lys Arg Asp 
225 230 235 240 

gtg gat ata ate acc tta tat cag aag gtc tct gtt tgg ctg tta aga 768 
Val Asp He He Thr Leu Tyr Gin Lys Val Ser Val Trp Leu Leu Arg 
245 250 255 

tat gac ttt gtg ctt gaa tat cct agg ccg gtc atg ccc aac atg gtc 816 
Tyr Asp Phe Val Leu Glu Tyr Pro Arg Pro Val Met Pro Asn Met Val 
260 265 270 

ttc att gga ggt ate aac tgt aag aag agg aaa gac ttg tct cag 861 
Phe lie Gly Gly He Asn Cys Lys Lys Arg Lys Asp Leu Ser Gin 
275 280 285 





<210> 


10 




<211> 


287 




<212> 


PRT 




<213> 


Homo 




<400> 


10 


Met 


Ala Cys 


Leu ] 


1 






Phe 


Leu Ala 


Leu ' 



sapiens 

>eu Arg Ser Phe Gin Arg He Ser Ala Gly VaJ 
$ 10 15 

Trp Gly Met Val Val Gly Asp Lys Leu Leu VaJ 
20 25 30 
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Pro Gin Asp Gly Ser His Trp Leu Ser Met Lys Asp lie Val Glu Val 

35 40 45 

Leu Ser Asp Arg Gly His Glu lie Val Val Val Val Pro Glu Val Asn 

50 55 60 

Leu Leu Leu Lys Glu Ser Lys Tyr Tyr Thr Arg Lys lie Tyr Pro Val 
65 70 75 80 

Pro Tyr Asp Gin Glu Glu Leu Lys Asn Arg Tyr Gin Ser Phe Gly Asn 

85 90 95 

Asn His Phe Ala Glu Arg Ser Phe Leu Thr Ala Pro Gin Thr Glu Tyr 

100 105 HO 

Arg Asn Asn Met He Val He Gly Leu Tyr Phe He Asn Cys Gin Ser 

115 120 125' 

Leu Leu Gin Asp Arg Asp Thr Leu Asn Phe Phe Lys Glu Ser Lys Phe 

130 135 140 

Asp Ala Leu Phe Thr Asp Pro Ala Leu Pro Cys Gly Val He Leu Ala 
145 150 155 160 

Glu Tyr Leu Gly Leu Pro Ser Val Tyr Leu Phe Arg Gly Phe Pro Cys 

165 170 175 

Ser Leu Glu His Thr Phe Ser Arg Ser Pro Asp Pro Val Ser Tyr He 

180 185 190 

Pro Arg Cys Tyr Thr Lys Phe Ser Asp His Met Thr Phe Ser Gin Arg 

195 200 205 

Val Ala Asn Phe Leu Val Asn Leu Leu Glu Pro Tyr Leu Phe Tyr Cys 

210 215 220 

Leu Phe Ser Lys Tyr Glu Glu Leu Ala Ser Ala Val Leu Lys Arg Asp 
225 230 235 240 

Val Asp He He Thr Leu Tyr Gin Lys Val Ser Val Trp Leu Leu Arg 

245 250 255 

Tyr Asp Phe Val Leu Glu Tyr Pro Arg Pro Val Met Pro Asn Met Val 

260 265 270 

Phe lie Gly Gly He Asn Cys Lys Lys Arg Lys Asp Leu Ser Gin 
275 280 285 



<210> 11 

<211> 951 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . . (951) 



<400> 11 

atg get cgt gca ggg tgg act ggc etc ctt ccc eta tat gtg tgt eta 
Met Ala Arg Ala Gly Trp Thr Gly Leu Leu Pro Leu Tyr Val Cys Leu 
15 10 15 

ctg ctg ace tgt get ttg cca agg tea ggg aag ctg ctg gta gtg ccc 
Leu Leu Thr Cys Ala Leu Pro Arg Ser Gly Lys Leu Leu Val Val Pro 
20 25 30 

atg gat ggg age cac tgg ttc ace atg cag teg gtg gtg gag aaa etc 
Met Asp Gly Ser His Trp Phe Thr Met Gin Ser Val Val Glu Lys Leu 
35 ' 40 45 

ate etc agg ggg cat gag gtg gtc gta gtc atg cca gag gtg agt tgg 
He Leu Arg Gly His Glu Val Val Val Val Met Pro Glu Val Ser Trp 
50 55 60 

caa ctg gga aga tea ctg aat tgc aca gtg aag act tac tea acc tea 
Gin Leu Gly Arg Ser Leu Asn Cys Thr Val Lys Thr Tyr Ser Thr Ser 
65 70 75 80 



48 



96 



144 



192 



240 
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tac act ctg gag gat cag gac egg gag ttc atg gtt ttt gec gat get 288 

Tvr Thr Leu Glu Asp Gin Asp Arg Glu Phe Met Val Phe Ala Asp Ala 

3 oc on 



85 90 95 

cgc tgg acg gca cca ttg cga agt gca ttt tct eta tta aca agt tea 
Arg Trp Thr Ala Pro Leu Arg Ser Ala Phe Ser Leu Leu Thr Ser Ser 
100 105 HO 

tec aat ggt att ttt gac tta ttt ttt tea aat tgc agg agt ttg ttt 
Ser Asn Gly He Phe Asp Leu Phe Phe Ser Asn Cys Arg Ser Leu Phe 
115 120 125 

aat gac cga aaa tta gta gaa tac tta aag gag agt tgt ttt gat gca 
Asn Asp Arg Lys Leu Val Glu Tyr Leu Lys Glu Ser Cys Phe Asp Ala 
130 135 140 

gtg ttt etc gat cct ttt gat cgc tgt ggc tta att gtt gec aaa tat 
Val Phe Leu Asp Pro Phe Asp Arg Cys Gly Leu He Val Ala Lys Tyr 
145 150 155 160 

ttc tec etc ccc tct gtg gtc ttc gec agg gga ata ttt tgc cac tat 
Phe Ser Leu Pro Ser Val Val Phe Ala Arg Gly He Phe Cys His Tyr 
165 170 175 

ctt gaa gaa ggt gca cag tgc cct get cct ctt tec tat gtc ccc aga 
Leu Glu Glu Gly Ala Gin Cys Pro Ala Pro Leu Ser Tyr Val Pro Arg 
180 185 190 

ctt etc tta ggg ttc tea gac gee atg act ttc aag gag aga gta tgg 
Leu Leu Leu Gly Phe Ser Asp Ala Met Thr Phe Lys Glu Arg Val Trp 
195 200 205 

aac cac ate atg cac ttg gag gaa cat tta ttt tgc ccc tat ttt ttc 
Asn His He Met His Leu Glu Glu His Leu Phe Cys Pro Tyr Phe Phe 
210 215 220 

aaa aat gtc tta gaa ata gec tct gaa att etc caa acc cct gtc acg 
Lys Asn Val Leu Glu He Ala Ser Glu He Leu Gin Thr Pro Val Thr 
225 230 235 240 

gca tat gat etc tac age cac aca tea att tgg ttg ttg cga act gac 
Ala Tyr Asp Leu Tyr Ser His Thr Ser He Trp Leu Leu Arg Thr Asp 
245 250 255 

ttt gtt ttg gag tat ccc aaa ccc gtg atg ccc aat atg ate ttc att 
Phe Val Leu Glu Tyr Pro Lys Pro Val Met Pro Asn Met He Phe He 
260 265 270 

ggt ggt ate aac tgt cat cag gga aag cca gtg cct atg gta agt tat 
Gly Gly He Asn Cys His Gin Gly Lys Pro Val Pro Met Val Ser Tyr 
275 280 285 

etc ccc ttt age aca tta aga ata ate tgg ctt tgg aaa tta aaa gat 
Leu Pro Phe Ser Thr Leu Arg He He Trp Leu Trp Lys Leu Lys Asp 
290 .295 300 

ttc tta cag aat cat aat tta tea ttt aca ttt gtc cca 
Phe Leu Gin Asn His Asn Leu Ser Phe Thr Phe Val Pro 
305 310 315 



336 



384 



432 



480 



528 



57 6 



624 



672 



720 



7 68 



816 



864 



912 



951 



<210> 12 
<211> 317 
<212> PRT 
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<213> Homo sapiens 
<400> 12 

Met Ala Arg Ala Gly Trp Thr Gly Leu Leu Pro Leu Tyr Val Cys Leu 

1 5 10 15 

Leu Leu Thr Cys Ala Leu Pro Arg Ser Gly Lys Leu Leu Val Val Pro 

20 25 30 

Met Asp Gly Ser His Trp Phe Thr Met Gin Ser Val Val Glu Lys Leu 

35 40 45 

He Leu Arg Gly His Glu Val Val Val Val Met Pro Glu Val Ser Trp 

50 55 60 

Gin Leu Gly Arg Ser Leu Asn Cys Thr Val Lys Thr Tyr Ser Thr Ser 
65 70 ?5 80 

Tyr Thr Leu Glu Asp Gin Asp Arg Glu Phe Met Val Phe Ala Asp Ala 

85 90 95 

Arg Trp Thr Ala Pro Leu Arg Ser Ala Phe Ser Leu Leu Thr Ser Ser 

100 105 HO 

Ser Asn Gly He Phe Asp Leu Phe Phe Ser Asn Cys Arg Ser Leu Phe 

115 120 125 

Asn Asp Arg Lys Leu Val Glu Tyr Leu Lys Glu Ser Cys Phe Asp Ala 

130 135 140 

Val Phe Leu Asp Pro Phe Asp Arg Cys Gly Leu He Val Ala Lys Tyr 
X45 150 155 160 

Phe Ser Leu Pro Ser Val Val Phe Ala Arg Gly He Phe Cys His Tyr 

165 170 175 

Leu Glu Glu Gly Ala Gin Cys Pro Ala Pro Leu Ser Tyr Val Pro Arg 

180 185 190 

Leu Leu Leu Gly Phe Ser Asp Ala Met Thr Phe Lys Glu Arg Val Trp 

195 200 205 

Asn His He Met His Leu Glu Glu His Leu Phe Cys Pro Tyr Phe Phe 

210 215 220 

Lys Asn Val Leu Glu He Ala Ser Glu He Leu Gin Thr Pro Val Thr 
225 230 235 240 

Ala Tyr Asp Leu Tyr Ser His Thr Ser He Trp Leu Leu Arg Thr Asp 

245 250 255 

Phe Val Leu Glu Tyr Pro Lys Pro Val Met Pro Asn Met He Phe He 

260 265 270 

Gly Gly He Asn Cys His Gin Gly Lys Pro Val Pro Met Val Ser Tyr 

275 280 285 

Leu Pro Phe Ser Thr Leu Arg He He Trp Leu Trp Lys Leu Lys Asp 

290 295 . 300 

Phe Leu Gin Asn His Asn Leu Ser Phe Thr Phe Val Pro 
305 310 315 



<210> 13 

<211> 930 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1)...(930) 

<400> 13 

atg get cgc aca ggg tgg. acc age 

Met Ala Arg Thr Gly Trp Thr Ser 
1 5 

ctg ctg acc tgt ggc ttt get gag 
Leu Leu Thr Cys Gly Phe Ala Glu 
20 



ccc att ccc eta tgt gtt tct ctg 
Pro He Pro Leu Cys Val Ser Leu 
10 15 

gca ggg aag ctg ctg gta gtg ccc 
Ala Gly Lys Leu Leu Val Val Pro 
25 30 
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atg gat ggg agt cac tgg ttc acc atg cag teg gtg gtg gag aaa ctt 



PCT/US99/09702 

144 



Met Asp Gly Ser His Trp Phe Thr Met Gin Ser Val Val Glu Lys Leu 
35 40 45 

ate etc agg ggg cat gag gtg gtt gta gtc atg cca gag gtg agt tgg 
He Leu Arg Gly His Glu Val Val Val Val Met Pro Glu Val Ser Trp 
50 55 60 

caa ctg gga aaa tea ctg aat tgc aca gtg aag act tac tea acc tea 
Gin Leu Gly Lys Ser Leu Asn Cys Thr Val Lys Thr Tyr Ser Thr Ser 
65 70 75 80 

tac act ctg gag gat ctg gac egg gaa ttc atg gat ttc gec gat get 
Tyr Thr Leu Glu Asp Leu Asp Arg Glu Phe Met Asp Phe Ala Asp Ala 
85 90 95 

caa tgg aaa gca caa gta cga agt ttg ttt tct eta. ttt ctg agt tea 
Gin Trp Lys Ala Gin Val Arg Ser. Leu Phe Ser Leu Phe Leu Ser Ser 
100 105 110 

tec aat ggt ttt ttt aac tta ttt ttt teg cat tgc agg agt ttg ttt 
Ser Asn Gly Phe Phe Asn Leu Phe Phe Ser His Cys Arg Ser Leu Phe 
115 120 125 

aat gac cga aaa tta gta gaa tac tta aag gag agt tct ttt gat gcg 
Asn Asp Arg Lys Leu Val Glu Tyr Leu Lys Glu Ser Ser Phe Asp Ala 
130 135 140 

gtg ttt ctt gat cct ttt gat gec tgt gcg tta att gtt gec aaa tat 
Val Phe Leu Asp Pro Phe Asp Ala Cys Ala Leu He Val Ala Lys Tyr 
145 150 155 160 

ttc tec etc ccc tct gtg gtc ttc gee agg gga ata ggt tgc cac tat 
Phe Ser Leu Pro Ser Val Val Phe Ala Arg Gly He Gly Cys His Tyr 
165 170 175 

ctt gaa gaa ggt gca cag tgc cct get cct ctt tec tat gtc ccc aga 
Leu Glu Glu Gly Ala Gin Cys Pro Ala Pro Leu Ser Tyr Val Pro Arg 
180 185 190 

att etc tta ggg ttc tea gat gec atg act ttc aag gag aga gta egg 
He Leu Leu Gly Phe Ser Asp Ala Met Thr Phe Lys Glu Arg Val Arg 
195 200 205 

aac cac ate atg cac ttg gag gaa cat tta ttt tgc cag tat ttt tec 
Asn His He Met His Leu Glu Glu His Leu Phe Cys Gin Tyr Phe Ser 
210 215 220 

aaa aat gec eta gaa ata gec tct gaa att etc caa aca cct gtc aca 
Lys Asn Ala Leu Glu He Ala Ser Glu He Leu Gin Thr Pro Val Thr 
225 230 235 240 

gca tat gat etc tac age cac aca tea att tgg ttg ttg cga aca gac 
Ala Tyr Asp Leu Tyr Ser His Thr Ser He Trp Leu Leu Arg Thr Asp 
245 250 255 

ttt gtt ttg gac tat ccc aaa ccc gtg atg ccc aat atg ate ttc att 
Phe Val Leu Asp Tyr Pro Lys Pro Val Met Pro Asn Met He Phe He 
260 265 270 

ggt ggt ate aac tgc cat cag gga aag cca ttg cct atg gta agt cac 
Gly Gly He Asn Cys His Gin Gly Lys Pro Leu Pro Met Val Ser His 
275 280 285 



192 



240 



288 



336 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 



864 
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etc tec ttt age aca tta gga ata ate ttg get ttg gaa att aaa aaa 9 
Leu Ser Phe Ser Thr Leu Gly He lie Leu Ala Leu Glu He Lys Lys 
290 295 300 

9 

aga ttc ctt act gaa ttg 
Arg Phe Leu Thr Glu Leu 
305 310 



<210> 14 
<211> 310 
<212> PRT 

<213> Homo sapiens 
<400> 14 

Met Ala Arg Thr Gly Trp Thr Ser Pro He Pro Leu Cys Val Ser Leu 

1 5 10 15 

Leu Leu Thr Cys Gly Phe Ala Glu Ala Gly Lys Leu Leu Val Val Pro 

20 25 30 

Met Asp Gly Ser His Trp Phe Thr Met Gin Ser Val Val Glu Lys Leu 

35 40 45 

He Leu Arg Gly His Glu Val Val Val Val Met Pro Glu Val Ser Trp 

50 55 60 

Gin Leu Gly Lys Ser Leu Asn Cys Thr Val Lys Thr Tyr Ser Thr Ser 
65 70 75 8 ? 

Tvr Thr Leu Glu Asp Leu Asp Arg Glu Phe Met Asp Phe Ala Asp Ala 

* 85 90 95 

Gin Trp Lys Ala Gin Val Arg Ser Leu Phe Ser Leu Phe Leu Ser Ser 

100 105 HO 

Ser Asn Gly Phe Phe Asn Leu Phe Phe Ser His Cys Arg Ser Leu Phe 

115 120 125 

Asn Asp Arg Lys Leu Val Glu Tyr Leu Lys Glu Ser Ser Phe Asp Ala 

130 135 140 

Val Phe Leu Asp Pro Phe Asp Ala Cys Ala Leu He Val Ala Lys Tyr 
14 5 150 155 1^° 

Phe Ser Leu Pro Ser Val Val Phe Ala Arg Gly He Gly Cys His Tyr 

165 170 I 75 

Leu Glu Glu Gly Ala Gin Cys Pro Ala Pro Leu Ser Tyr Val Pro Arg 

180 185 190 

He Leu Leu Gly Phe Ser Asp Ala Met Thr Phe Lys Glu Arg Val Arg 

195 200 205 

Asn His He Met His Leu Glu Glu His Leu Phe Cys Gin Tyr Phe Ser 

210 215 220 

Lys Asn Ala Leu Glu He Ala Ser Glu He Leu Gin Thr Pro Val Thr 
225 230 235 240 

Ala Tyr Asp Leu Tyr Ser His Thr Ser He Trp Leu Leu Arg Thr Asp 

245 250 255 

Phe Val Leu Asp Tyr Pro Lys Pro Val Met Pro Asn Met He Phe He 

260 265 270 

Glv Gly He Asn Cys His Gin Gly Lys Pro Leu Pro Met Val Ser His 

275 280 285 

Leu Ser Phe Ser Thr Leu Gly He He Leu Ala Leu Glu He Lys Lys 

290 295 300 

Arg Phe Leu Thr Glu Leu 
305 310- 

<210> 15 

<211> 759 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . . (759) 
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ata qat 4 ggg lit cac tgg ttc acc atg cag teg gtg gtg gag aaa ctt 48 
Me? Asp Gly Ser His Trp Phe Thr Met Gin Ser Val Val Glu Lys Leu 
I 5 10 13 

ate etc agg ggg cat gag gtg gtt gta gtc atg cca gag gtg agt tgg 
lie Leu Arg Gly His Glu Val Val Val Val Met Pro Glu Val Ser Trp 
20 25 30 



caa ctg gaa aga tea ctg aat tgc aca gtg aag act tac tea acc teg 
Leu Glu Arg Ser Leu Asn Cys Thr Val Lys Thr Tyr Ser Thr Ser 



35 



40 *5 



tac act ctg gaa gat cag aac egg gaa ttc atg gtt ttc gec cat get 
Tyr Thr III Glu Asp Gin Asn Arg Glu Phe Met Val Phe Ala His Ala 
* 55 60 



50 



caa tgg aaa gca cag gca caa agt- ata ttt tct eta tta atg agt tea 
Gin Trp Lys Ala Gin Ala Gin Ser He Phe Ser Leu Leu Met Ser Ser 



65 



70 75 



tec aqt got ttt ctt gac tta ttt ttt teg cat tgc agg agt ttg ttt 
Ser Ser Gly Phe Leu Asp Leu Phe Phe Ser His Cys Arg Ser Leu Phe 

90 yD 



85 



aat gac cga aaa tta gta gaa tac tta aag gag agt tct ttt gat gca 
Asa Asp Arg Lys Leu Val Glu Tyr Leu Lys Glu Ser Ser Phe Asp Ala 

100 105 

nta ttt eta aat cct ttt gat acc tgt ggc tta att gtt get aaa tat 
&L Phe Leu Asp Pro Phe Lp Thr Cys Gly Leu lie Val Ala Lys Tyr 

120 125 



ttc tec etc ccc tct gtg gtc ttc acc agg gga ata ttt tgc cac cat 
Phe Ser Leu Pro Ser Val Val Phe Thr Arg Gly lie Phe Cys His His 

135 140 



130 



ctt gaa gaa ggt gca cag tgc cct get cct ctt tec tat gtc ccc aat 
Leu Glu Glu Gly Ala Gin Cys Pro Ala Pro Leu Ser Tyr Val Pro Asn 
150 155 



145 



aat etc tta ggg ttc tea gat gec atg act ttc aag gag aga gta tgg 
Sp Leu Leu III Phe Ser Lp Ala Met Thr Phe Lys Glu Arg Val Trp 

aac cac ate gtg cac ttg gag gac cat tta ttt tgc cag tat ctt ttt 
Asn Sis lie Val His Leu Glu Asp His Leu Phe Cys Gin Tyr Leu Phe 
180 185 190 

aga aat gec eta gaa ata gee tct gaa att etc caa acc cct gtc acg 
£rg Asn Ala Leu Glu lie Ala Ser Glu lie Leu Gin Thr Pro Val Thr 
195 200 205 

aca tat qat etc tac agt cac aca tea att tgg ttg ttg cga acg gac 
Ma Tyr Asp Leu Tyr Ser His Thr Ser He . Trp Leu Leu Arg Thr Asp 
210 215 220 

ttt gtt ttg gac tat ccc aaa ccc gtg atg ccc aac atg ate ttc att 
Phe Val Leu L P Tyr Pro Lys Pro Val Met Pro Asn Met lie Phe lie 
225 230 235 

qqt ggt ate aac tgt cat cag gga aag cca ttg cct atg 
Gly Gly He Asn Cys His Gin Gly Lys Pro Leu Pro Met 
245 250 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



57 6 



624 



672 



720 



759 
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<210> 16 
<2X1> 253 
<212> PRT 

<213> Homo sapiens 

Met Asp'ciy Ser His Trp Phe Thr Met Gin Ser Val Val Glu Lys Leu 

i 5 10 _ 

Xle Leu Arg Gly His Glu Val Val Val Val Met Pro Glu Val Ser Trp 

20 25 ou 

Gin Leu Glu Arg Ser Leu Asn Cys Thr Val Lys Thr Tyr Ser Thr Ser 
35 40 



Tyr Thr Leu Glu Asp Gin Asn Arg Glu Phe Met Val Phe Ala His Ala 
Gin ?r P Lys Ala Gin Ala Gin Ser lie Phe Ser Leu Leu Met Ser Ser 



Ser Ser Gly Phe Leu Asp Leu Phe Phe Ser His Cys Arg Ser Leu Phe 

85 90 
Asn Asp Arg Lys Leu Val Glu Tyr Leu Lys Glu Ser Ser Phe Asp Ala. 

100 105 x± m 

val Phe Leu Asp Pro Phe Asp Thr Cys Gly Leu He Val Ala Lys Tyr 

115 120 1 ** 3 

Phe Ser Leu Pro Ser Val Val Phe Thr Arg Gly lie Phe Cys His Hxs 

Leu III Glu Gly Ala Gin Cys Pro Ala Pro Leu Ser Tyr Val Pro Asn 

HI Leu Leu Gly Phe Ser Asp Ala Met Thr Phe Lys Glu Arg Val Trp 

165 1^0 . 

Asn His He Val His Leu Glu Asp His Leu Phe Cys Gin Tyr Leu Phe 

180 185 „ m *_ 

Arg Asn Ala Leu Glu lie Ala Ser Glu He Leu Gin Thr Pro Val Thr 

nqs 200 /u:> 

Ala Tyr Asp Leu Tyr Ser His Thr Ser lie Trp Leu Leu Arg Thr Asp 

n i n 215 ^.^u 

Phe III Leu Asp Tyr Pro Lys Pro Val Met Pro Asn Met lie Phe lie 
725 230 

Glv Gly He Asn Cys His Gin Gly Lys Pro Leu Pro Met 
245 250 

<210> 17 

<211> 735 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) ... (735) 



aaa tt! « gcc tac att aat get tct gga gaa cat gga att gtg gtt 
I" Phe Glu Ala Tyr lie Asn Ala Ser Gly Glu His Gly He Val Val 

ttc tct ttg gga tea atg gtc tea gaa att cca gag aag aaa get atg 
Phe Ser Leu Gly Ser Me? Val Ser Glu He Pro Glu Lys Lys Ala Met 
20 25 

nra att ac t aat act ttg ggc aaa ate'ect cag aca gtc ctg tgg egg 
K lie Ala Asp Ala Leu G?y Lys lie Pro Gin Thr Val Leu Trp Arg 

40 *° 



96 



144 
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tac act gga acc cga cca teg aat ctt gcg aac aac acg at a ctt gtt 
Tyr Thr Gly Thr Arg Pro Ser Asn. Leu Ala Asn Asn Thr lie Leu Val 
50 55 60 

aag tgg eta ccc caa aac gat ctg ctt ggt cac ccg atg acc cgt gec 
Lvs Trp Leu Pro Gin Asn Asp Leu Leu Gly His Pro Met Thr Arg Ala 
65 70 75 80 

ttt ate acc cat get ggt tec cat ggt gtt tat gaa age ata tgc aat 
Phe He Thr His Ala Gly Ser His Gly Val Tyr Glu Ser He Cys Asn 
85 90 95 

ggc gtt ccc atg gtg atg atg ccc ttg ttt ggt gat cag atg gac aat 
Gly Val Pro Met Val Met Met Pro Leu Phe Gly Asp Gin Met Asp Asn 
100 105 HO 

gca aag cgc atg gag act aag gga get gga gtg acc ctg aat gtt ctg 
Ala Lys Arg Met Glu Thr Lys Gly. Ala Gly Val Thr Leu Asn Val Leu 
115 120 125 

gaa atg act tct gaa gat tta gaa aat get eta aaa gca gtc ate aat 
Glu Met Thr Ser Glu Asp Leu Glu Asn Ala Leu Lys Ala Val lie Asn 
130 135 140 

gac aaa agt tac aag gag aac ate atg cgc etc tec age ctt cac aag 
Asp Lys Ser Tyr Lys Glu Asn He Met Arg Leu Ser Ser Leu His Lys 
145 150 155 160 

gac cgc ccg gtg gag ccg ctg gac ctg gec gtg ttc tgg gtg gag ttt 
Asd Arq Pro Val Glu Pro Leu Asp Leu Ala Val Phe Trp Val Glu Phe 
165 170 175 

gtg atg agg cac aag ggc gcg cca cac ctg cgc ccc gca gec cac gac 
Val Met Arg His Lys Gly Ala Pro His Leu Arg Pro Ala Ala His Asp 
180 185 190 

etc acc tgg tac cag tac cat tec ttg gac gtg att ggt ttc etc ttg 
Leu Thr Trp Tyr Gin Tyr His Ser Leu Asp Val He Gly Phe Leu Leu 
195 200 205 

gec gtc gtg ctg aca gtg gec ttc ate acc ttt aaa tgt tgt get tat 
Ala Val Val Leu Thr Val Ala Phe He Thr Phe Lys Cys Cys Ala Tyr 
210 215 220 

ggc tac egg aaa tgc ttg ggg aaa aaa ggg cga gtt aag aaa gee cac 
Gly Tyr Arg Lys Cys Leu Gly Lys Lys Gly Arg Val Lys Lys Ala His 
225 230 235 240 

aaa tec aag acc cat 
Lys Ser Lys Thr His 
245 



PCT/US99/09702 

192 



240 



288 



336 



384 



432 



480 



528 



57 6 



624 



672 



720 



735 



<210> 18 
<211> 245 
<212> PRT 

<213> Homo sapiens 
<400> 18 

Glu Phe Glu Ala Tyr He Asn Ala Ser Gly Glu His Gly He Val Val 

15 10 15 

Phe Ser Leu Gly Ser Met Val Ser Glu lie Pro Glu Lys Lys Ala Met 
20 25 30 
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TT, L », «. « «* a- •» «• - n 1 - T - - 

IyI Thr S» Thr Ar a Pro s.r A.„ Leu Ma A,„ A„ Thr 11. L.» v.l 
W5 Trp Leu Pro Sl» As„ Lp Leu L.u Gly His «o Het Thr «9 «• 
£. 11. Thr His Ala Gly ser Hie G.» val Tyr Glu s.r 11. Cys *» 
Gly „1 Pro «.« Si ».t Met Pro Leu Phe Gly M. «. « Aep » 
Ala „y, Ar, i« «. »r Lye «j Km « - - « « - 
ota „et Thr Ser Giu L.u Glu « Al. L.u Ly. Al. val n. » 
„ S 3« Tyr Lye O. » « «1 J- - «« »» " 1S 3S 
JS Arg Pro Val ». Pro Leu Asp L.u M. Val Ph. Trp V.l Glu Ph. 

v,i „et «, »i= £ «i» «■ « «g - p " ila a 

L- Thr T,p Tyr Gl» T,r His S.r L.u Asp Val .1. «» - « - 

Ala val v.! Leu Thr V,l Al, IS. 11. Thr Ph. Lys cys Cys Ala Tyr 
01y Tyr at, Lys Cys Leu II, Lye Lys Gly «, v.l Lys Lye Al. Hi. 



225 

Lys Ser Lys Thr His 
245 



<210> 19 
<211> 19 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Primer 



<400> 19 
tggtgtatcg attggtttt 

<210> 20 
<211> 23 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Primer 



<400> 20 
catatatctg gggctagtta ate 

<210> 21 
<211> 26 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Primer 

<4 00> 21 
:aaggtaat taagatgaag aaagca 



<210> 22 
<211> 20 
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<212> DNA 

<213> Artificial Sequence" 
<220> 

<223> Primer 

<400> 22 
acctgagata gtggcttcct 

<210> 23 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 23 
tttgtcttcc aattacatgc 

<210> 24 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 24 
agtagatatg gaagcacttg taag 

<210> 25 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 25 
tctcagtgac aaggtaatta agac 

<210> 26 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 26 
cattgattgg ataaaggca 

<210> 27 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 27 
aatttgggtt cttacatatc aa 



-20- 



PCT/US99/09702 

WO 99/57322 

<210> 28 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 28 
gagtgaggga ggacagag 

<210> 29 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 29 
ataagtacac gccttctttt g 

<210> 30 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 30 
gctgctttat acaatttgct ac 

<210> 31 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 31 
cgcctacgta tcatagcagt ta 

<210> 32 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 32 
ggaaagaaat ttgaaatgca ac 

<210> 33 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 33 
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tctttccgcc tactgtatca 

<210> 34 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 34 
ttcaagaagg gcagttttat 

<210> 35 
<211> 17 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 35 
ctctggcagg agcaaag 

<210> 36 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 36 
atacacacct gggatagtgg 

<210> 37 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 37 
ggtaattaag atgaagaaag ca 

<210> 38 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 38 
ctgagatagt ggcttcctg 

<210> 39 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
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<400> 39 
gtggctcaat gacaagg 

<210> 40 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 40 
atatggaagc acttgtaagt aaa 

<210> 41 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 41 
ttaagacgaa ggaaacaatt ct 

<210> 42 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 42 
acctgagata gtggcttcc 

<210> 43 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 43 
atcaaagggt aaaattcaga 

<210> 44 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 44 
ggcagtccaa aagaaata 

<210> 45 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Primer 

<400> 45 
ttttgagggc aggttcta 

<210> 46 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 46 
aatgggacaa atgtaaatga ta 

<210> 47 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 47 
ttctctcatg gctcgca 

<210> 48 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 48 
atgtcaaatc acaattcagt aagg 

<210> 49 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 49 
ccgcctactg tatcatagca 

<210> 50 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 50 
caacgaaatg tcaaatcaca g 

<210> 51 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Primer 

<400> 51 
ctctggcagg agcaaag 

<210> 52 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 52 
acagtgggca gagacag 

<210> 53 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 53 
gtggtttatt ccccgtat 

<210> 54 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 54 
atacacacct gggatagtgg 

<210> 55 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 55 
ggtaattaag atgaagaaag ca 

<210> 56 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 56 
gaaatggcat aggttgtc 

<210> 57 
<211> 17 
<212> DNA 
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<213> Artificial Sequence 



PCT/US99/09702 



<220> 

<223> Primer 

<400> 57 
ggccacactc aactgta 

<210> 58 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 58 
ctcaaaaaaa acacagtagg 

<210> 59 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 59 
actttttctg ccccttat 

<210> 60 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 60 
atatggaagc acttgtaagt aaa 

<210> 61 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 61 
ttaagacgaa ggaaacaatt ct 

<210> 62 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<40O> 62 
aatggcatac gttgtca 

<210> 63 
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<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 63 
agaatggcaa ttatgaaca 

<210> 64 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 64 
tgtgtgccct taaagtct 

<210> 65 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 65 
agaatggcaa ttatgaaca 

<210> 66 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 66 
acctgagata gtggcttcc 

<210> 67 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 67 
ctctggctct gtcctac 

<210> 68 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 68 
acctgagata gtggcttcc 
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19 



18 



19 
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<210> 69 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> €9 
atcaaagggt aaaattcaga 

<210> 70 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 70 
cagcagcttg tcacctac 

<210> 71 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 71 
aatttgcttt tgaaagaatc 

<210> 72 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 72 
ggtaggccca aatactca 

<210> 73 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 73 
aatttgcttt tgaaagaatc 

<210> 74 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
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<400> 74 
ggcagtccaa aagaaata 

<210> 75 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 75 
ttttgagggc aggttcta 

<210> 76 
<2U> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 76 
cacctctggc atgactac 

<210> 77 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 77 
ttgcaggagt ttgtttaat 

<210> 78 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 78 
aatgggacaa atgtaaatga ta 

<210> 79 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 79 
cattgcagga gtttgttta 

<210> 80 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Primer 

<400> 80 
catctgagaa ccctaagaga 

<210> 81 
<211> 20 
<2X2> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 81 
agaaatagcc tctgaaattc 

<210> 82 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 82 
atgtcaaatc acaattcagt aagg 

<210> 83 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 83 
ccgcctactg tatcatagca 

<210> 84 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 84 
gagtgtacga ggttgagtaa g 

<210> 85 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 85 
attttgccag tatcttttta g 

<210> 86 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Primer 

<400> 86 
caacgaaatg tcaaatcaca g 

<210> 87 

<211> 27 ' 

<212> DNA 

<213> Homo sapiens 

<400> 87 
catcagagac agagcatttt acacctt 

<210> 88 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<400> 88 
ggacctattg agccctgcat ctgtct 

<210> 89 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 89 
ggttcccctg ccgcggctgg ccaca 

<210> 90 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 90 
gccctgggct gaaagtggaa ag 

<210> 91 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 91 
atgcgggagg ccttgcggga get 

<210> 92 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 92 
ctctgcgcgg cggtgctggc taag 

<210> 93 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 93 
taccccaggc caatcatgcc caaca 

<210> 94 
<211> 27 
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<212> DNA 

<213> Homo sapiens 

<400> 94 
tccaggcaaa atacttttta aaaaatg 

<210> 95 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 95 
agcatgcggg aggcctcgcg gga 

<210> 96 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 96 
gcgggagctc catgcgagag g 

<210> 97 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 97 
tggtggtcct caccccggag gtgaa 

<210> 98 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<400> 98 
tacatcaaag aggagaactt tttcac 

<210> 99 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<400> 99 
tgatcaggca cctgaatgct acttcc 

<210> 100 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 100 
acctctgcgg ggcggtgctg g 

<210> 101 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 101 
aagaacatgc tttaccctct ggc 

<210> 102 
<211> 18 
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<212> DNA 

<213> Homo sapiens 

<400> 102 
ctctggctct gtcctacc 

<210> 103 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 103 
tcctaccttt gctatgctgt ttct 

<210> 104 

<211> 17 

<212> DNA 

<213> Homo sapiens 

<400> 104 
tgtcagtggt ggatatt 

<210> 105 

<211> 16 

<212> DNA 

<213> Homo sapiens 

<400> 105 
ggtggatatt ctcagc 

<210> 106 

<211> 13 

<212> DNA 

<213> Homo sapiens 

<400> 106 
tcagctatgc ate 

<210> 107 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 107 
gcatccgtgt ggctgttccg a 

<210> 108 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 108 
tggctgttcc gaegggaett 

<210> 109 

<211> 16 

<212> DNA 

<213> Homo sapiens 

<400> 109 
gggacttcgt gatgga 

<210> 110 
<211> 23 
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<212> DNA 

<213> Homo sapiens 

<400> 1X0 
gtgatggact accccaggcc gat 

<210> 111 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 111 
cctgcctcct tcgcgcattt cagag 

<210> 112 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 112 
gcgatcattc ctgactgctc ctcag 

<210> 113 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 113 
ccctggagca tgcattcagc ag 

<210> 114 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 114 
cattcagcag cagcccagac cct 

<210> 115 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 115 
tacttcttcc acgtactata tta 

<210> 116 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 116 
ggcctccttc cactatatgt gtgt 

<210> 117 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 117 
ggagagagta cggaaccaca t 

<210> 118 
<211> 23 
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<212> DNA 

<213> Homo sapiens 

<400> 118 
tcaatttggt tattgcgaac tga 

<210> 119 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 119 
caggggaata gcttgccact at 

<210> 120 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 120 
tgttgcgaac ggactttgtt ttgg 

<210> 121 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 121 
ttcaccagca atcggtggtg g 

<210> 122 
<211> 25 
<212> DNA 

<213> Homo sapiens 

<400> 122 
ctagaaatag cttctgaaat tctcc 

<210> 123 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 123 
cggcatatga tatctacagt caca 

<210> 124 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 124 
tcaatttggt tgctgcgaac aggac 
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